cheap cluely that sorta sucks

a local-first, undetectable AI desktop assistant that listens to meetings and on-screen content, then provides real-time answers using Google's Gemini API. Built with Python and designed to be invisible to others in video calls.

Features

Screen Context Capture: OCR technology reads text from your screen (presentations, code, chat, etc.)
Audio Transcription: Real-time speech-to-text using Whisper for meeting audio
AI-Powered Responses: Google Gemini API provides context-aware answers
Translucent Overlay UI: Always-on-top, draggable interface that never blocks your work
⌨Global Hotkeys: Quick access with customizable keyboard shortcuts
Privacy-First: All processing happens locally except AI API calls
Invisible to Others: Never joins meetings or appears in screen shares

Screenshots

The assistant appears as a translucent overlay in the top-right corner of your screen, providing instant AI-powered responses based on your current context.

Installation

Prerequisites

Python 3.9 or higher
Windows 10/11 (primary support)
Google Gemini API key (free tier available)

Step 1: Clone and Setup

git clone <repository-url>
cd hintly

Step 2: Install Dependencies

pip install -r requirements.txt

Step 3: Install Additional Dependencies

For OCR (Choose one):

Option A: Install Tesseract OCR
- Windows: Download from GitHub
- Add to PATH: C:\Program Files\Tesseract-OCR
Option B: Use screen-ocr (automatic fallback)

For Audio:

Ensure your microphone is working
For system audio capture on Windows, enable "Stereo Mix" in sound settings

Step 4: Set Up Gemini API

Get a free API key from Google AI Studio
Set the environment variable:

Windows:

set GEMINI_API_KEY=your_api_key_here

Linux/Mac:

export GEMINI_API_KEY=your_api_key_here

Usage

Starting the Assistant

python cluely_assistant.py

Controls

Toggle Overlay: Ctrl + Alt + C (default)
Voice Trigger: Ctrl + Alt + V (optional)
Drag: Click and drag the overlay to reposition
Minimize: Click the − button to hide
Close: Click the × button to exit

How to Use

Start the assistant - It will appear in the top-right corner
Join a meeting - The assistant listens to audio and captures screen content
Ask questions - Type queries about the meeting, presentation, or screen content
Get instant answers - AI provides context-aware responses

Example Queries

"What's the main topic of this meeting?"
"Summarize the key points from the presentation"
"What code is currently on my screen?"
"What questions should I ask about this topic?"
"Help me understand this technical discussion"

Configuration

Edit config.py to customize:

Hotkeys: Change keyboard shortcuts
UI Settings: Adjust overlay size, opacity, position
Audio Settings: Modify recording duration, sample rate
OCR Settings: Change capture intervals, confidence thresholds

Architecture

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│   Screen Capture│    │  Audio Capture  │    │   Gemini API    │
│   (OCR)         │    │   (Whisper)     │    │   (AI)          │
└─────────────────┘    └─────────────────┘    └─────────────────┘
         │                       │                       │
         └───────────────────────┼───────────────────────┘
                                 │
                    ┌─────────────────┐
                    │  Main Assistant │
                    │   (Orchestrator)│
                    └─────────────────┘
                                 │
                    ┌─────────────────┐
                    │  Overlay UI     │
                    │  (PyQt5)        │
                    └─────────────────┘

Privacy & Security

Local Processing: Screen capture and audio transcription happen locally
Selective Sharing: Only your query and necessary context are sent to Gemini
No Meeting Participation: The assistant never joins calls or appears in recordings
Data Retention: Audio transcripts are kept only in memory and cleared regularly

Troubleshooting

Common Issues

"GEMINI_API_KEY not set"
- Ensure the environment variable is set correctly
- Restart your terminal after setting the variable
OCR not working
- Install Tesseract OCR or ensure screen-ocr is available
- Check that text is visible and not too small
Audio not capturing
- Check microphone permissions
- Enable "Stereo Mix" for system audio capture
- Ensure Whisper model downloaded successfully
Overlay not appearing
- Check if another application is blocking it
- Try the hotkey Ctrl + Alt + C
- Restart the application

Logs

Check cluely_assistant.log for detailed error information.

Development

Project Structure

hintly/
├── cluely_assistant.py    # Main application entry point
├── config.py             # Configuration settings
├── screen_capture.py     # OCR and screen capture
├── audio_capture.py      # Audio recording and transcription
├── gemini_client.py      # Google Gemini API integration
├── overlay_ui.py         # PyQt5 overlay interface
├── hotkey_manager.py     # Global hotkey handling
├── requirements.txt      # Python dependencies
└── README.md            # This file

Contributing

Fork the repository
Create a feature branch
Make your changes
Test thoroughly
Submit a pull request

License

This project is open source. Please respect the privacy and security guidelines when using or modifying this software.

Acknowledgments

Inspired by Cluely
Built with Google Gemini API
Uses OpenAI Whisper for speech recognition
PyQt5 for the user interface

Support

For issues and questions:

Check the troubleshooting section
Review the logs
Open an issue on GitHub

Note: This is a local-first implementation designed for personal use. Always respect privacy and security best practices when using AI assistants.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
.python-version		.python-version
README.md		README.md
audio_capture.py		audio_capture.py
cluely_assistant.py		cluely_assistant.py
config.py		config.py
demo.py		demo.py
gemini_client.py		gemini_client.py
hotkey_manager.py		hotkey_manager.py
install.bat		install.bat
overlay_ui.py		overlay_ui.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run.bat		run.bat
screen_capture.py		screen_capture.py
setup.py		setup.py
test_installation.py		test_installation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

cheap cluely that sorta sucks

Features

Screenshots

Installation

Prerequisites

Step 1: Clone and Setup

Step 2: Install Dependencies

Step 3: Install Additional Dependencies

For OCR (Choose one):

For Audio:

Step 4: Set Up Gemini API

Usage

Starting the Assistant

Controls

How to Use

Example Queries

Configuration

Architecture

Privacy & Security

Troubleshooting

Common Issues

Logs

Development

Project Structure

Contributing

License

Acknowledgments

Support

About

Uh oh!

Releases

Packages

Contributors 2

Languages

nwx77/cheap-cluely

Folders and files

Latest commit

History

Repository files navigation

cheap cluely that sorta sucks

Features

Screenshots

Installation

Prerequisites

Step 1: Clone and Setup

Step 2: Install Dependencies

Step 3: Install Additional Dependencies

For OCR (Choose one):

For Audio:

Step 4: Set Up Gemini API

Usage

Starting the Assistant

Controls

How to Use

Example Queries

Configuration

Architecture

Privacy & Security

Troubleshooting

Common Issues

Logs

Development

Project Structure

Contributing

License

Acknowledgments

Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages