AI Skills
Voice & Speech AI
Speech-to-text, text-to-speech, voice cloning, real-time voice agents.
All 76AI Agents8AI Frameworks8Coding Assistants2Code Generation6MCP Servers4RAG & Vector DBs4Prompt Engineering5LLM Evals4Voice & Speech AI4Image & Vision AI4Chatbots & Companions4AI Apps4Workflow Automation3Browser Automation2CLI Tools3IDE Extensions3DevOps & MLOps1Fine-Tuning3Local LLM Runtimes3Data Extraction1
Voice & Speech AI
β 75.0kWhisper
by openai
OpenAI's open-source multilingual speech recognition model. State-of-the-art transcription, runs locally with the right hardware.
Python
Voice & Speech AI
β 39.0kwhisper.cpp
by ggerganov
High-performance C/C++ port of OpenAI Whisper. Real-time transcription on a Mac, no GPU required.
C++
Voice & Speech AI
β 36.0kCoqui TTS
by coqui-ai
Deep-learning toolkit for text-to-speech, including 1100+ languages, voice cloning from 6-second samples, and many pretrained models.
Python
Voice & Speech AI
β 7.0kLiveKit Agents
by livekit
Build real-time voice and video AI agents. Plug in any LLM + TTS + STT β LiveKit handles the WebRTC plumbing.
Python