Models

Qwen2-VL 7B Instruct

Qwen2 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements:SoTA understanding of images of various resolution & ratio: Qwen2-VL achieves state-of-the-art performance o...

Qwen 32K context $0.1/M input tokens $0.1/M output tokens $0.144/K image tokens

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related ta ...

OpenAI 125K context $15/M input tokens $60/M output tokens

AI21: Jamba 1.5 Large

Text 2 text

Jamba 1.5 Large is part of AI21's new family of open models, offering superior speed, efficiency, and quality. It features a 256K effective context window, the longest among open models, enabling im ...

Ai21 250K context $2/M input tokens $8/M output tokens

Llama 3.1 Euryale 70B v2.2

Text 2 text

Euryale L3.1 70B v2.2 is a model focused on creative roleplay from Sao10k. It is the successor of Euryale L3 70B v2.1. ...

Rifx.Online 8K context $0.35/M input tokens $0.4/M output tokens

AI21: Jamba 1.5 Mini

Text 2 text

Jamba 1.5 Mini is the world's first production-grade Mamba-based model, combining SSM and Transformer architectures for a 256K context window and high efficiency. It works with 9 languages and can h ...

Ai21 250K context $0.2/M input tokens $0.4/M output tokens

Nous: Hermes 3 70B Instruct

Text 2 text

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning ...

NousreSearch 128K context $0.4/M input tokens $0.4/M output tokens

Nous: Hermes 3 405B Instruct

Text 2 text

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context cohere ...

NousreSearch 128K context $1.79/M input tokens $2.49/M output tokens

FREE

Meta: Llama 3.2 11B Vision Instruct (free)

Text image 2 text

# Free

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and visual question answ ...

Meta Llama 128K context $0 input tokens $0 output tokens $0.079/K image tokens

Meta: Llama 3.2 11B Vision Instruct

Text image 2 text

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and visual question answ ...

Meta Llama 128K context $0.055/M input tokens $0.055/M output tokens $0.079/K image tokens

Lumimaid v0.2 8B

Text 2 text

Lumimaid v0.2 8B is a finetune of Llama 3.1 8B with a "HUGE step up dataset wise" compared to Lumimaid v0.1. Sloppy chats output were purged. Usage of this model ...

Meta Llama 128K context $0.188/M input tokens $1.125/M output tokens

GPT-4o

Text image 2 text

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twi ...

OpenAI 125K context $2.5/M input tokens $10/M output tokens $0.004/M image tokens

OpenAI: GPT-4o

Text image 2 text

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twi ...

OpenAI 125K context $2.5/M input tokens $10/M output tokens $0.004/M image tokens

OpenAI: GPT-4o-mini

Text image 2 text

GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more afford ...

OpenAI 125K context $0.15/M input tokens $0.6/M output tokens $0.007/M image tokens

Google: Gemini 1.5 Flash-8B

Text image 2 text

Gemini 1.5 Flash-8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective ...

Google 976.56K context $0.037/M input tokens $0.15/M output tokens

Inflection: Inflection 3 Productivity

Text 2 text

Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines For emotional intelligence similar to Pi, ...

Inflection 7.81K context $2.5/M input tokens $10/M output tokens

Models

Qwen2-VL 7B Instruct

OpenAI: o1-preview

AI21: Jamba 1.5 Large

Llama 3.1 Euryale 70B v2.2

AI21: Jamba 1.5 Mini

Nous: Hermes 3 70B Instruct

Nous: Hermes 3 405B Instruct

Meta: Llama 3.2 11B Vision Instruct (free)

Meta: Llama 3.2 11B Vision Instruct

Lumimaid v0.2 8B

GPT-4o

OpenAI: GPT-4o

OpenAI: GPT-4o-mini

Google: Gemini 1.5 Flash-8B

Inflection: Inflection 3 Productivity

Categories

Tags