Skip to content

[Feature]: Add support for CompactifAI #14476

@ikaadil

Description

@ikaadil

The Feature

Add support for CompactifAI as a LLM provider.

Motivation, pitch

CompactifAI offers highly compressed versions of leading language models, delivering up to 70% lower inference costs, 4x throughput gains, and low-latency inference with minimal quality loss (<5%).
Its OpenAI-compatible API makes integration straightforward, while enabling developers to build ultra-efficient, scalable AI apps with superior concurrency and resource efficiency.

Adding CompactifAI would give users a cost-effective, high-performance provider option.

Website / Social links

Twitter / LinkedIn details

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions