Adding New Models from Llama, Mistral, GPT, and Gemini
Jul 25, 2024

Toni Lopez
Software Engineering at Stack AI
New models available
Llama 3.1 (via Groq, Together AI and Meta nodes):
World's largest and most capable open-source foundation model with 405 billion parameters (smaller versions are also available in Stack AI -> 70B and 8B for simpler tasks and faster execution)
Supports advanced use cases like long-form text summarization, multilingual conversational agents, and coding assistants, with a context length of 128K tokens.
Mistral Large 2
123 billion-parameter model with a 128K-token context window** (less parameters than Llama 3.1 an similar performance 🤯)
Excels in code generation, mathematics, and multilingual tasks, outperforming many leading models in these areas.
GPT 4o mini
Cost-efficient small model that surpasses GPT-3.5 Turbo.
It offers superior textual intelligence and multimodal reasoning, with 128K-token context window.
Enables a broad range of tasks, including real-time text responses and applications requiring large context handling. Best for every day simple tasks
Gemini 1.5 Flash
Lightweight and fast model optimized for speed and efficiency (comparable to GPT 4o mini)
Breakthrough one-million-token context window, making it ideal for processing extensive video, audio, and large codebases.
Mistral nemo
Specialized model designed for high-precision tasks in scientific and technical domains, with a 128K-token context window.
It focuses on reducing hallucinations and improving reasoning capabilities, making it suitable for research and development applications.
Other features
Images to LLMs with vision: upload or copy paste images in the chat assistant for processing by GPT 4o or Anthropic Claude 3.5 Sonnet.
Google Search: select the country in which to perform the search.
Make your organization smarter with AI.
Deploy custom AI Assistants, Chatbots, and Workflow Automations to make your company 10x more efficient.