audio

Tag

Hugging Face API wrapper for Delphi
Hugging Face API wrapper for Delphi 1.0.1
by MaxiDonkey (Galactic911) in IDE Plugins for Delphi

The Hugging Face API wrapper for Delphi leverages cutting-edge models to deliver powerful features, including object detection, music generation, text classification, sentiment analysis, image segmentation, speech-to-text transcription, and text generation.

9 Jan 2025
MIT license

MakerAI Suite: Advanced AI Components for Delphi
MakerAI Suite: Advanced AI Components for Delphi 1.3.0
by CimaMaker in Components for Delphi

The MakerAI Suite is a comprehensive set of Delphi components designed to integrate artificial intelligence into your applications. With support for state-of-the-art models and functionalities, the suite includes tools for natural language processing, audio transcription, image generation and other.

6 Jan 2025
MIT License

Trial - RVMedia for FMX
Trial - RVMedia for FMX 11.0
by Sergey Tkachenko in Trial for RAD Studio

RVMedia is a set of components for displaying video from various sources, controlling IP cameras, organizing video chats, recording audio and video files (Windows, macOS and Linux platforms).

12 Dec 2024
Trial

Trial - RVMedia for VCL
Trial - RVMedia for VCL 11.0
by Sergey Tkachenko in Trial for RAD Studio

RVMedia is a set of components for displaying video from various sources, controlling IP cameras, organizing video chats, recording audio and video files.

12 Dec 2024
Trial

GroqCloud API wrapper for Delphi
GroqCloud API wrapper for Delphi 1.0
by MaxiDonkey (Galactic911) in IDE Plugins for Delphi

The GroqCloud API wrapper for Delphi provides access to models from Meta, OpenAI, MistralAI and Google on Groq’s LPUs, offering chat, text generation, image analysis, audio transcription, JSON output, tool integration, and content moderation capabilities.

29 Nov 2024
MIT license

Gemini API wrapper for Delphi
Gemini API wrapper for Delphi 1.0
by MaxiDonkey (Galactic911) in IDE Plugins for Delphi

The Gemini API wrapper for Delphi utilizes advanced models developed by Google to provide robust capabilities, including interactive chat, text embeddings, code generation, image and video prompting, audio analysis and transcription, fine-tuning, caching, and integration with Google Search.

12 Nov 2024
MIT license