■ Document 클래스의 metadata 속성을 사용해 유튜브 동영상 정보를 구하는 방법을 보여준다.
▶ main.py
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
from langchain_community.document_loaders import YoutubeLoader youtubeLoader = YoutubeLoader.from_youtube_url("https://www.youtube.com/watch?v=HAn9vnJy6S4", add_video_info = True) documentList = youtubeLoader.load() document = documentList[0] metadataDictionary = document.metadata print(metadataDictionary) """ { 'source' : 'o7C9ld6Ln-M', 'title' : 'LangServe and LangChain Templates Webinar', 'description' : 'Unknown', 'view_count' : 5363, 'thumbnail_url' : 'https://i.ytimg.com/vi/o7C9ld6Ln-M/hqdefault.jpg?sqp=-oaymwEXCJADEOABSFryq4qpAwkIARUAAIhCGAE=&rs=AOn4CLDf7gvV8D3I2UFy0UsA2Wh0qUhA-A', 'publish_date' : '2023-11-02 00:00:00', 'length' : 2441, 'author' : 'LangChain' } """ |
▶ requirements.txt
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 |
aiohttp==3.9.5 aiosignal==1.3.1 annotated-types==0.7.0 async-timeout==4.0.3 attrs==23.2.0 certifi==2024.6.2 charset-normalizer==3.3.2 dataclasses-json==0.6.7 frozenlist==1.4.1 greenlet==3.0.3 idna==3.7 jsonpatch==1.33 jsonpointer==3.0.0 langchain==0.2.3 langchain-community==0.2.4 langchain-core==0.2.5 langchain-text-splitters==0.2.1 langsmith==0.1.77 marshmallow==3.21.3 multidict==6.0.5 mypy-extensions==1.0.0 numpy==1.26.4 orjson==3.10.4 packaging==23.2 pydantic==2.7.4 pydantic_core==2.18.4 pytube==15.0.0 PyYAML==6.0.1 requests==2.32.3 SQLAlchemy==2.0.30 tenacity==8.3.0 typing-inspect==0.9.0 typing_extensions==4.12.2 urllib3==2.2.1 yarl==1.9.4 youtube-transcript-api==0.6.2 |
※ pip install langchain-community pytube youtube-transcript-api 명령을 실행했다.