Skip to content

This repository demonstrates the construction of a state-of-the-art multimodal search engine, leveraging Amazon Titan Embeddings, Amazon Bedrock, and LangChain.

License

Notifications You must be signed in to change notification settings

build-on-aws/langchain-embeddings

πŸš€ Multimodal Search Learning Experience

Getting started with Amazon Bedrock, RAG, and Vector database in Python

GitHub stars GitHub forks License Python AWS LangChain


Search across text, images, and video content with natural language queries

⭐ Star this repository

🎯 Learning Path: Explore β†’ Build β†’ Scale

πŸ› οΈ Component πŸ“ What You'll Learn ⏱️ Time πŸ“Š Level
πŸ““ Jupyter Notebooks Multimodal AI fundamentals with interactive tutorials 30-120 min Beginner
πŸ—„οΈ Aurora PostgreSQL Vector Database Vector database setup with pgvector extension 15 min Intermediate
⚑ Serverless Lambda Vector Database System Multi-modal document processing with Lambda 10 min Intermediate
πŸŽ₯ Ask Your Video: Audio/Video Processing Pipeline Video analysis with ECS and vector search 25 min Advanced

πŸ““ Learning Notebooks

πŸ““ Notebook 🎯 Focus & Key Learning ⏱️ Time πŸ“Š Level πŸ–ΌοΈ Diagram
01 - Semantic Search with LangChain, Amazon Titan Embeddings, and FAISS Text embeddings and PDF processing - Document chunking, embeddings generation, FAISS vector store operations 30 min Beginner PDF Vector DB
02 - Building a Multimodal Image Search App with Titan Embeddings Visual search capabilities - Image embeddings, multimodal search, natural language image queries 45 min Intermediate Image Vector DB
03 - Supercharging Vector Similarity Search with Amazon Aurora and pgvector Production database setup - PostgreSQL vector operations, pgvector extension, scalable similarity search 60 min Intermediate Aurora Setup
04 - Video Understanding Video content analysis - Nova models for video processing, content extraction, video understanding workflows 45 min Advanced Video Understanding
05 - Video and Audio Content Analysis with Amazon Bedrock Audio processing workflows - Transcription, audio embeddings, multimedia content analysis 40 min Advanced Video Analysis
06 - Building Agentic Video RAG with Strands Agents - Local AI agents for video analysis - Local agent implementation, memory-enhanced agents, persistent context storage 90 min Expert Local Agent
07 - Building Agentic Video RAG with Strands Agents - Cloud Production agent deployment - Cloud-based agent architecture, ECS deployment, scalable agent workflows 120 min Expert Cloud Agent

☁️ Demo Applications

πŸ—οΈ Application πŸ“ Description ⏱️ Deploy Time πŸ“Š Complexity πŸ–ΌοΈ Diagram
πŸ—„οΈ Aurora PostgreSQL Vector Database CDK stack for vector database setup 15 min Intermediate Aurora Architecture
⚑ Serverless Lambda Vector Database System Multi-modal processing with Lambda functions 10 min Intermediate Serverless Architecture
πŸŽ₯ Ask Your Video Processing Pipeline ECS-based video analysis system 25 min Advanced Video Pipeline

πŸ”§ AWS Services Demonstrated

Learn AWS's Advanced AI and Database Services

πŸ”§ Service 🎯 Purpose ⚑ Key Capabilities
Amazon Bedrock AI model access Titan Embeddings, Nova models for multimodal processing
Amazon Aurora PostgreSQL Vector database pgvector extension for similarity search operations
AWS Lambda Serverless compute Event-driven document and image processing
Amazon ECS Container orchestration Scalable video processing workflows
Amazon S3 Object storage Document, image, and video content storage
Amazon Transcribe Speech-to-text Audio content extraction from video files
AWS Step Functions Workflow orchestration Complex multi-step video processing
Amazon API Gateway API management RESTful endpoints for search operations

πŸ’° Cost Estimation

πŸ’° Service πŸ’΅ Approximate Cost πŸ“Š Usage Pattern πŸ”— Pricing Link
Amazon Bedrock ~$0.10 per 1K tokens Text/image embeddings Bedrock Pricing
Aurora PostgreSQL ~$0.08/hour Vector database operations Aurora Pricing
AWS Lambda ~$0.0001/request API endpoint calls Included in AWS Free Tier
Amazon S3 ~$0.023/GB/month Content storage S3 Pricing
Amazon Transcribe ~$0.024/minute Audio processing Transcribe Pricing

πŸ’‘ Start with notebooks for local development at no cost, then explore AWS services within Free Tier limits.


🎯 Prerequisites

Before You Begin:

  • AWS Account with Amazon Bedrock access enabled
  • Python 3.8+ installed locally
  • AWS CLI configured with appropriate permissions
  • Docker installed (for container-based demos)

AWS Credentials Setup: Follow the AWS credentials configuration guide to configure your environment.


πŸš€ Quick Start Guide

1. Start Learning

git clone https://github.com/build-on-aws/langchain-embeddings.git
cd langchain-embeddings/notebooks

2. Deploy Vector Database (15 minutes)

cd create-aurora-pgvector
cdk deploy

3. Build Serverless APIs (10 minutes)

cd serveless-embeddings
cdk deploy

4. Scale with Containers (25 minutes)

cd container-video-embeddings
cdk deploy

πŸ“š Additional Learning Resources


⭐ Star this repository β€’ πŸ“– Start Learning

Star History


πŸ‡»πŸ‡ͺπŸ‡¨πŸ‡± Β‘Gracias!

πŸ“„ License

This library is licensed under the MIT-0 License. See the LICENSE file for details.

About

This repository demonstrates the construction of a state-of-the-art multimodal search engine, leveraging Amazon Titan Embeddings, Amazon Bedrock, and LangChain.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •