Explore our collection of cutting-edge AI models available on ModelHarbor. From ultra-fast lightweight models to advanced reasoning engines — all accessible via our OpenAI-compatible API.
Looking for max value? This is the ULTIMATE budget king! 💸 Qwen3.6-35B-A3B is a Mixture-of-Experts model with only 3B active parameters out of 35B total — meaning it's crazy efficient while still delivering solid performance. It supports vision, function calling, AND computer use at the lowest price point on the platform. No cap, this is the move for high-volume tasks that need to stay wallet-friendly!
Need AI-generated images? Gemini 2.5 Flash Image is your go-to! 🎨 This model specializes in creating and understanding images with Google's latest Gemini technology. It supports vision, computer use, and web search — all at an incredibly affordable price point. Perfect for creative projects, visual analysis, and multimodal applications!
Need blazing-fast responses without breaking the bank? Gemini 3.1 Flash Lite is the speed champion! 🏎️ This lightweight model delivers Gemini-quality responses at lightning speed with a massive 1M token context window. It supports vision, computer use, and web search — making it incredibly versatile for its price. The ultimate choice for high-throughput applications!
GLM-5 is the value champion for reasoning tasks! 🧩 This model from Zhipu AI delivers impressive thinking capabilities with built-in reasoning support and prompt caching for even better efficiency. With a massive 202K context window and function calling support, it's perfect for complex analytical tasks that don't require vision. The prompt caching feature means repeated queries cost even less!
The next generation of image generation is here! ✨ Gemini 3.1 Flash Image Preview brings Google's latest multimodal capabilities with a 65K token context and 65K max output. It supports vision and web search, making it perfect for creative workflows that need both image understanding and generation. This preview model gives you early access to cutting-edge image AI technology!
When you need maximum performance from the GLM family, GLM-5.1 (glm-max) delivers! 💎 This is Zhipu AI's most capable model with 202K context, prompt caching, and parallel function calling. It's designed for demanding text-based tasks that require deep understanding and precise outputs. The perfect choice when you need top-tier reasoning without multimodal overhead.
Need GLM power at turbo speed? GLM-5-Turbo is your answer! ⚡ Same 202K context and function calling capabilities as glm-max, but optimized for faster responses. Perfect for production workloads that need both quality and speed. When you want GLM-level intelligence with minimal latency, this is the one!
DeepSeek V4 Flash is the NEW budget champion! ⚡ Same price as Qwen3.6 but with a jaw-dropping 3.84M token max output — that's not a typo! This model can generate massively long responses, making it perfect for code generation, long-form content, and complex multi-step tasks. Plus it supports vision, computer use, and function calling at the cheapest price on the platform!
Gemini 3 Flash Preview is the balanced powerhouse in Google's lineup! ⚖️ With a massive 1M token context, vision support, PDF processing, web search, and computer use — it's got everything you need for serious multimodal work. The premium pricing reflects its top-tier capabilities across all modalities. Perfect when you need the full Gemini experience with deep document understanding!
DeepSeek V4 Pro is the premium DeepSeek model with the same jaw-dropping 3.84M token max output! 🔋 This is DeepSeek's most capable model — combining advanced reasoning with vision, computer use, and function calling. When you need DeepSeek's best quality output with massive generation length, this is the one. A serious contender in the pro-tier model space!
The king of the hill has arrived! 👑 Gemini 3.1 Pro Preview is Google's most powerful AI model, delivering state-of-the-art performance across all benchmarks. With a 1M token context window, vision, computer use, and web search — this is the ultimate model for the most demanding tasks. When you need the absolute best, no compromises, Gemini 3.1 Pro is the answer!