Type something to search...

Models

Qwen2 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements:SoTA understanding of images of various resolution & ratio: Qwen2-VL achieves state-of-the-art performance o...

Qwen2-VL 7B Instruct
Qwen
32K context $0.1/M input tokens $0.1/M output tokens $0.144/K image tokens

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related ta ...

OpenAI: o1-preview
OpenAI
125K context $15/M input tokens $60/M output tokens

Jamba 1.5 Large is part of AI21's new family of open models, offering superior speed, efficiency, and quality. It features a 256K effective context window, the longest among open models, enabling im ...

AI21: Jamba 1.5 Large
Ai21
250K context $2/M input tokens $8/M output tokens

Euryale L3.1 70B v2.2 is a model focused on creative roleplay from Sao10k. It is the successor of Euryale L3 70B v2.1. ...

Llama 3.1 Euryale 70B v2.2
Rifx.Online
8K context $0.35/M input tokens $0.4/M output tokens

Jamba 1.5 Mini is the world's first production-grade Mamba-based model, combining SSM and Transformer architectures for a 256K context window and high efficiency. It works with 9 languages and can h ...

AI21: Jamba 1.5 Mini
Ai21
250K context $0.2/M input tokens $0.4/M output tokens

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning ...

Nous: Hermes 3 70B Instruct
NousreSearch
128K context $0.4/M input tokens $0.4/M output tokens

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context cohere ...

Nous: Hermes 3 405B Instruct
NousreSearch
128K context $1.79/M input tokens $2.49/M output tokens
FREE

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and visual question answ ...

Meta: Llama 3.2 11B Vision Instruct (free)
Meta Llama
128K context $0 input tokens $0 output tokens $0.079/K image tokens

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and visual question answ ...

Meta: Llama 3.2 11B Vision Instruct
Meta Llama
128K context $0.055/M input tokens $0.055/M output tokens $0.079/K image tokens

Lumimaid v0.2 8B is a finetune of Llama 3.1 8B with a "HUGE step up dataset wise" compared to Lumimaid v0.1. Sloppy chats output were purged. Usage of this model ...

Lumimaid v0.2 8B
Meta Llama
128K context $0.188/M input tokens $1.125/M output tokens

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twi ...

GPT-4o
OpenAI
125K context $2.5/M input tokens $10/M output tokens $0.004/M image tokens

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twi ...

OpenAI: GPT-4o
OpenAI
125K context $2.5/M input tokens $10/M output tokens $0.004/M image tokens

GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more afford ...

OpenAI: GPT-4o-mini
OpenAI
125K context $0.15/M input tokens $0.6/M output tokens $0.007/M image tokens

Gemini 1.5 Flash-8B is optimized for speed and efficiency, offering enhanced performance in small prompt tasks like chat, transcription, and translation. With reduced latency, it is highly effective ...

Google: Gemini 1.5 Flash-8B
Google
976.56K context $0.037/M input tokens $0.15/M output tokens

Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines For emotional intelligence similar to Pi, ...

Inflection: Inflection 3 Productivity
Inflection
7.81K context $2.5/M input tokens $10/M output tokens
Tags
Type something to search...