Add bedrock latency optimized inference support + Vertex AI Multimodal embedding cost tracking #9623

krrishdholakia · 2025-03-28T20:08:44Z

fix(converse_transformation.py): add performanceConfig param support on bedrock

Closes #7606

fix(converse_transformation.py): refactor to use more flexible single getter for params which are separate config blocks
test(test_main.py): add e2e mock test for bedrock performance config
build(model_prices_and_context_window.json): add versioned multimodal embedding
refactor(multimodal_embeddings/): migrate to config pattern
feat(vertex_ai/multimodalembeddings): calculate usage for multimodal embedding calls

Enables cost calculation for multimodal embeddings

feat(vertex_ai/multimodalembeddings): get usage object for embedding calls

ensures accurate cost tracking for vertexai multimodal embedding calls

fix(embedding_handler.py): remove unused imports
fix: fix linting errors
fix: handle response api usage calculation
test(test_vertex_ai_multimodal_embedding_transformation.py): update tests
test: mark flaky test
feat(vertex_ai/multimodal_embeddings/transformation.py): support text+image+video input
docs(vertex.md): document sending text + image to vertex multimodal embeddings
test: remove incorrect file
fix(multimodal_embeddings/transformation.py): fix linting error
style: remove unused import

…on bedrock Closes #7606

… getter for params which are separate config blocks

vercel · 2025-03-28T20:08:49Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Mar 29, 2025 3:19am

… embedding

…embedding calls Enables cost calculation for multimodal embeddings

…calls ensures accurate cost tracking for vertexai multimodal embedding calls

litellm/cost_calculator.py

+        return Usage(**usage_obj.model_dump())
+    else:
+        verbose_logger.debug(
+            f"Unknown usage object type: {type(usage_obj)}, usage_obj: {usage_obj}"


To fix the problem, we need to ensure that sensitive information is not logged in clear text. The best way to fix this without changing existing functionality is to redact or mask the sensitive information before logging it. Specifically, we should modify the logging statement on line 492 in litellm/cost_calculator.py to exclude or mask the sensitive data.

…ests

…+image+video input

…mbeddings

codecov · 2025-03-29T07:18:59Z

Codecov Report

Attention: Patch coverage is 82.03883% with 37 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
.../vertex_ai/multimodal_embeddings/transformation.py	71.87%	36 Missing ⚠️
...rtex_ai/multimodal_embeddings/embedding_handler.py	87.50%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

krrishdholakia added 2 commits March 28, 2025 13:05

fix(converse_transformation.py): add performanceConfig param support …

4055449

…on bedrock Closes #7606

fix(converse_transformation.py): refactor to use more flexible single…

a5fabef

… getter for params which are separate config blocks

vercel bot deployed to Preview March 28, 2025 20:08 View deployment

test(test_main.py): add e2e mock test for bedrock performance config

510756b

vercel bot deployed to Preview March 28, 2025 20:14 View deployment

build(model_prices_and_context_window.json): add versioned multimodal…

9654545

… embedding

vercel bot deployed to Preview March 28, 2025 22:20 View deployment

refactor(multimodal_embeddings/): migrate to config pattern

155e2c6

vercel bot deployed to Preview March 28, 2025 23:04 View deployment

feat(vertex_ai/multimodalembeddings): calculate usage for multimodal …

93f3386

…embedding calls Enables cost calculation for multimodal embeddings

vercel bot deployed to Preview March 29, 2025 00:18 View deployment

feat(vertex_ai/multimodalembeddings): get usage object for embedding …

8712d76

…calls ensures accurate cost tracking for vertexai multimodal embedding calls

vercel bot deployed to Preview March 29, 2025 00:33 View deployment

fix(embedding_handler.py): remove unused imports

3499c8b

vercel bot deployed to Preview March 29, 2025 00:35 View deployment

github-advanced-security bot found potential problems Mar 29, 2025

View reviewed changes

fix: fix linting errors

5b15ba8

vercel bot deployed to Preview March 29, 2025 00:47 View deployment

fix: handle response api usage calculation

cff3a76

vercel bot deployed to Preview March 29, 2025 02:21 View deployment

krrishdholakia added 2 commits March 28, 2025 19:39

test(test_vertex_ai_multimodal_embedding_transformation.py): update t…

1683cd8

…ests

test: mark flaky test

6fd40b4

vercel bot deployed to Preview March 29, 2025 02:40 View deployment

vercel bot deployed to Preview March 29, 2025 02:41 View deployment

feat(vertex_ai/multimodal_embeddings/transformation.py): support text…

4b1afd8

…+image+video input

vercel bot deployed to Preview March 29, 2025 02:56 View deployment

docs(vertex.md): document sending text + image to vertex multimodal e…

79f1af3

…mbeddings

vercel bot deployed to Preview March 29, 2025 03:04 View deployment

test: remove incorrect file

14b9b5c

vercel bot deployed to Preview March 29, 2025 03:05 View deployment

fix(multimodal_embeddings/transformation.py): fix linting error

6479a65

vercel bot deployed to Preview March 29, 2025 03:16 View deployment

style: remove unused import

65dbfde

vercel bot deployed to Preview March 29, 2025 03:19 View deployment

krrishdholakia merged commit 5ac61a7 into main Mar 29, 2025
41 of 42 checks passed

krrishdholakia deleted the litellm_dev_03_28_2025_p2 branch March 29, 2025 07:23

krrishdholakia changed the title ~~Add bedrock latency optimized inference support~~ Add bedrock latency optimized inference support + Vertex AI Multimodal embedding cost tracking Mar 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add bedrock latency optimized inference support + Vertex AI Multimodal embedding cost tracking #9623

Add bedrock latency optimized inference support + Vertex AI Multimodal embedding cost tracking #9623

Uh oh!

krrishdholakia commented Mar 28, 2025 •

edited

Loading

Uh oh!

vercel bot commented Mar 28, 2025 •

edited

Loading

Uh oh!

Check failure

Copilot Autofix

codecov bot commented Mar 29, 2025

Uh oh!

Uh oh!

Uh oh!

@@ -490,4 +490,5 @@
                 else:
+                    redacted_usage_obj = {k: (v if k != "client_secret" else "REDACTED") for k, v in usage_obj.items()} if isinstance(usage_obj, dict) else "REDACTED"
                     verbose_logger.debug(
-                        f"Unknown usage object type: {type(usage_obj)}, usage_obj: {usage_obj}"
+                        f"Unknown usage object type: {type(usage_obj)}, usage_obj: {redacted_usage_obj}"
                     )

Uh oh!

Add bedrock latency optimized inference support + Vertex AI Multimodal embedding cost tracking #9623

Add bedrock latency optimized inference support + Vertex AI Multimodal embedding cost tracking #9623

Uh oh!

Conversation

krrishdholakia commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vercel bot commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Check failure

Uh oh!

Uh oh!

Copilot Autofix

codecov bot commented Mar 29, 2025

Codecov Report

Uh oh!

Uh oh!

Uh oh!

krrishdholakia commented Mar 28, 2025 •

edited

Loading

vercel bot commented Mar 28, 2025 •

edited

Loading