■ HumanMessage 클래스를 사용해 모델에 멀티모달 데이터를 전달하는 방법을 보여준다.
※ "image_url" 유형의 콘텐츠 블럭에 이미지 URL을 직접 입력할 수 있다.
※ 일부 모델 제공자만 이를 지원한다.
※ 여러 개의 이미지를 전달할 수도 있다.
※ OPENAI_API_KEY 환경 변수 값은 .env 파일에 정의한다.
▶ main.py
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
from dotenv import load_dotenv from langchain_core.messages import HumanMessage from langchain_openai import ChatOpenAI load_dotenv() imageURL1 = "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" imageURL2 = "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg" hummanMessage = HumanMessage( content = [ { "type" : "text", "text" : "describe the weather in this image" }, { "type" : "image_url", "image_url" : {"url" : imageURL1} }, { "type" : "image_url", "image_url" : {"url" : imageURL1} } ], ) chatOpenAI = ChatOpenAI(model = "gpt-4o") responseAIMessage = chatOpenAI.invoke([hummanMessage]) print(responseAIMessage.content) """ The weather in the image appears to be clear and sunny. The sky is mostly blue with some scattered, thin clouds, suggesting a pleasant day with good visibility. The sunlight is bright, casting clear shadows and illuminating the vibrant green of the grass and foliage. """ |
▶ requirements.txt
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 |
annotated-types==0.7.0 anyio==4.6.0 certifi==2024.8.30 charset-normalizer==3.3.2 colorama==0.4.6 distro==1.9.0 h11==0.14.0 httpcore==1.0.6 httpx==0.27.2 idna==3.10 jiter==0.5.0 jsonpatch==1.33 jsonpointer==3.0.0 langchain-core==0.3.8 langchain-openai==0.2.1 langsmith==0.1.131 openai==1.51.0 orjson==3.10.7 packaging==24.1 pydantic==2.9.2 pydantic_core==2.23.4 python-dotenv==1.0.1 PyYAML==6.0.2 regex==2024.9.11 requests==2.32.3 requests-toolbelt==1.0.0 sniffio==1.3.1 tenacity==8.5.0 tiktoken==0.8.0 tqdm==4.66.5 typing_extensions==4.12.2 urllib3==2.2.3 |
※ pip install python-dotenv langchain-openai 명령을 실행했다.