All functionality related to Google Cloud Platform and other Google products.
LLMs
Vertex AI
Access PaLM LLMs like text-bison and code-bison via Google Vertex AI.
We need to install google-cloud-aiplatform python package.
pip install google-cloud-aiplatform
See a usage example.
from langchain.llms import VertexAI
Model Garden
Access PaLM and hundreds of OSS models via Vertex AI Model Garden.
We need to install google-cloud-aiplatform python package.
pip install google-cloud-aiplatform
See a usage example.
from langchain.llms import VertexAIModelGarden
Chat models
Vertex AI
Access PaLM chat models like chat-bison and codechat-bison via Google Cloud.
We need to install google-cloud-aiplatform python package.
pip install google-cloud-aiplatform
See a usage example.
from langchain.chat_models import ChatVertexAI
Document Loaders
Google BigQuery
Google BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.
BigQueryis a part of theGoogle Cloud Platform.
We need to install google-cloud-bigquery python package.
pip install google-cloud-bigquery
See a usage example.
from langchain.document_loaders import BigQueryLoader
Google Cloud Storage
Google Cloud Storage is a managed service for storing unstructured data.
We need to install google-cloud-storage python package.
pip install google-cloud-storage
There are two loaders for the Google Cloud Storage: the Directory and the File loaders.
See a usage example.
from langchain.document_loaders import GCSDirectoryLoader
See a usage example.
from langchain.document_loaders import GCSFileLoader
Google Drive
Google Drive is a file storage and synchronization service developed by Google.
Currently, only Google Docs are supported.
We need to install several python packages.
pip install google-api-python-client google-auth-httplib2 google-auth-oauthlib
See a usage example and authorization instructions.
from langchain.document_loaders import GoogleDriveLoader
Speech-to-Text
Google Cloud Speech-to-Text is an audio transcription API powered by Google's speech recognition models.
This document loader transcribes audio files and outputs the text results as Documents.
First, we need to install the python package.
pip install google-cloud-speech
See a usage example and authorization instructions.
from langchain.document_loaders import GoogleSpeechToTextLoader
Vector Stores
Google Vertex AI Vector Search
Google Vertex AI Vector Search, formerly known as
Vertex AI Matching Engine, provides the industry's leading high-scale low latency vector database. These vector databases are commonly referred to as vector similarity-matching or an approximate nearest neighbor (ANN) service.
We need to install several python packages.
pip install tensorflow google-cloud-aiplatform tensorflow-hub tensorflow-text
See a usage example.
from langchain.vectorstores import MatchingEngine
Google ScaNN
Google ScaNN (Scalable Nearest Neighbors) is a python package.
ScaNNis a method for efficient vector similarity search at scale.
ScaNNincludes search space pruning and quantization for Maximum Inner Product Search and also supports other distance functions such as Euclidean distance. The implementation is optimized for x86 processors with AVX2 support. See its Google Research github for more details.
We need to install scann python package.
pip install scann
See a usage example.
from langchain.vectorstores import ScaNN
Retrievers
Google Drive
We need to install several python packages.
pip install google-api-python-client google-auth-httplib2 google-auth-oauthlib
See a usage example and authorization instructions.
from langchain_googledrive.retrievers import GoogleDriveRetriever
Vertex AI Search
Google Cloud Vertex AI Search allows developers to quickly build generative AI powered search engines for customers and employees.
We need to install the google-cloud-discoveryengine python package.
pip install google-cloud-discoveryengine
See a usage example.
from langchain.retrievers import GoogleVertexAISearchRetriever
Document AI Warehouse
Google Cloud Document AI Warehouse allows enterprises to search, store, govern, and manage documents and their AI-extracted data and metadata in a single platform.
from langchain.retrievers import GoogleDocumentAIWarehouseRetriever
docai_wh_retriever = GoogleDocumentAIWarehouseRetriever(
    project_number=...
)
query = ...
documents = docai_wh_retriever.get_relevant_documents(
    query, user_ldap=...
)
Tools
Google Drive
We need to install several python packages.
pip install google-api-python-client google-auth-httplib2 google-auth-oauthlib
See a usage example and authorization instructions.
from langchain.utilities.google_drive import GoogleDriveAPIWrapper
from langchain.tools.google_drive.tool import GoogleDriveSearchTool
Google Places
We need to install a python package.
pip install googlemaps
See a usage example and authorization instructions.
from langchain.tools import GooglePlacesTool
Google Search
We need to install a python package.
pip install google-api-python-client
- Set up a Custom Search Engine, following these instructions
- Get an API Key and Custom Search Engine ID from the previous step, and set them as environment variables GOOGLE_API_KEYandGOOGLE_CSE_IDrespectively
from langchain.utilities import GoogleSearchAPIWrapper
For a more detailed walkthrough of this wrapper, see this notebook.
We can easily load this wrapper as a Tool (to use with an Agent). We can do this with:
from langchain.agents import load_tools
tools = load_tools(["google-search"])
Document Transformers
Google Document AI
Document AI is a
Google Cloud Platformservice that transforms unstructured data from documents into structured data, making it easier to understand, analyze, and consume.
We need to set up a GCS bucket and create your own OCR processor
The GCS_OUTPUT_PATH should be a path to a folder on GCS (starting with gs://)
and a processor name should look like projects/PROJECT_NUMBER/locations/LOCATION/processors/PROCESSOR_ID.
We can get it either programmatically or copy from the Prediction endpoint section of the Processor details
tab in the Google Cloud Console.
pip install google-cloud-documentai
pip install google-cloud-documentai-toolbox
See a usage example.
from langchain.document_loaders.blob_loaders import Blob
from langchain.document_loaders.parsers import DocAIParser
Google Translate
Google Translate is a multilingual neural machine translation service developed by Google to translate text, documents and websites from one language into another.
The GoogleTranslateTransformer allows you to translate text and HTML with the Google Cloud Translation API.
To use it, you should have the google-cloud-translate python package installed, and a Google Cloud project with the Translation API enabled. This transformer uses the Advanced edition (v3).
First, we need to install the python package.
pip install google-cloud-translate
See a usage example and authorization instructions.
from langchain.document_transformers import GoogleTranslateTransformer
Toolkits
GMail
Gmail is a free email service provided by Google. This toolkit works with emails through the
Gmail API.
We need to install several python packages.
pip install google-api-python-client google-auth-oauthlib google-auth-httplib2
See a usage example and authorization instructions.
from langchain.agents.agent_toolkits import GmailToolkit
Google Drive
This toolkit uses the Google Drive API.
We need to install several python packages.
pip install google-api-python-client google-auth-httplib2 google-auth-oauthlib
See a usage example and authorization instructions.
from langchain_googledrive.utilities.google_drive import GoogleDriveAPIWrapper
from langchain_googledrive.tools.google_drive.tool import GoogleDriveSearchTool
Chat Loaders
GMail
Gmail is a free email service provided by Google. This loader works with emails through the
Gmail API.
We need to install several python packages.
pip install google-api-python-client google-auth-oauthlib google-auth-httplib2
See a usage example and authorization instructions.
from langchain.chat_loaders.gmail import GMailLoader
3rd Party Integrations
SearchApi
SearchApi provides a 3rd-party API to access Google search results, YouTube search & transcripts, and other Google-related engines.
See usage examples and authorization instructions.
from langchain.utilities import SearchApiAPIWrapper
SerpAPI
SerpApi provides a 3rd-party API to access Google search results.
See a usage example and authorization instructions.
from langchain.utilities import SerpAPIWrapper
YouTube
YouTube Search package searches
YouTubevideos avoiding using their heavily rate-limited API.It uses the form on the YouTube homepage and scrapes the resulting page.
We need to install a python package.
pip install youtube_search
See a usage example.
from langchain.tools import YouTubeSearchTool
YouTube audio
YouTube is an online video sharing and social media platform created by
Use YoutubeAudioLoader to fetch / download the audio files.
Then, use OpenAIWhisperParser to transcribe them to text.
We need to install several python packages.
pip install yt_dlp pydub librosa
See a usage example and authorization instructions.
from langchain.document_loaders.blob_loaders.youtube_audio import YoutubeAudioLoader
from langchain.document_loaders.parsers import OpenAIWhisperParser, OpenAIWhisperParserLocal
YouTube transcripts
YouTube is an online video sharing and social media platform created by
We need to install youtube-transcript-api python package.
pip install youtube-transcript-api
See a usage example.
from langchain.document_loaders import YoutubeLoader