Back to Home
Instella

Instella

Instella is a fully open 3B-parameter language model from AMD that generates and understands text, with openly released weights, training code, data, and hyperparameters.

Open Source
Try Now
7 views
0 comments

Instella is a fully open 3-billion-parameter language model family designed to deliver strong general-purpose language understanding and generation. Developed and trained by AMD on Instinctโ„ข MI300X GPUs, it targets researchers, developers, and organizations that require transparent, reproducible, and high-performing models without restrictive licensing. Instella-3B provides a foundation for tasks such as text generation, summarization, code assistance, and conversational AI in both research and production environments.

Instella stands out through its complete openness: AMD releases model weights, training hyperparameters, datasets, and code, enabling full inspection, customization, and extension. The models are trained from scratch using the AMD ROCm software stack and modern efficiency techniques including FlashAttention-2, Torch Compile, and Fully Sharded Data Parallelism (FSDP) with hybrid sharding for scalable training across large GPU clusters. Benchmarks show that Instella-3B significantly outperforms existing fully open models of similar size and delivers competitive performance relative to open-weight models such as Llama-3.2-3B, Gemma-2-2B, and Qwen-2.5-3B, including their instruction-tuned variants.

Tags

open source language modelllm developmentmodel training and fine tuningmachine learning researchamd instinct mi300x optimization

Launch Team

Alternatives & Similar Tools

Explore 453+ top alternatives to Instella

HipoCap

HipoCap

Securely govern AI agents by enforcing RBAC, blocking prompt injection, and monitoring all tool executions in real time with open-source observability and controls.

โ˜…0.0 (0 ratings)
AI AgentsLLM Models
0
4
OPEN_SOURCETry Now โ†’
Llama

Llama

Llama is a family of open-source large language models for text and multimodal understanding, generation, and reasoning, designed for integration into applications and services.

โ˜…0.0 (0 ratings)
App BuildersCustomer SupportLLM Models
0
42
OPEN_SOURCETry Now โ†’
Kimi K2 Thinking

Kimi K2 Thinking

Kimi K2 Thinking is an open-source AI reasoning and experimentation framework designed to explore an

โ˜…0.0 (0 ratings)
LLM Models
0
59
OPEN_SOURCETry Now โ†’
ChatArena

ChatArena

ChatArena is a framework for creating and running multi-agent language game environments that evaluate and develop communication and collaboration behaviors in large language models.

โ˜…0.0 (0 ratings)
LLM Models
0
77
OPEN_SOURCETry Now โ†’
OpenAGI

OpenAGI

OpenAGI is a research framework that integrates large language models with domain-specific tools and expert knowledge to build, evaluate, and improve task-oriented AI agents.

โ˜…0.0 (0 ratings)
AI AgentsLLM Models
0
100
OPEN_SOURCETry Now โ†’
BraveGPT

BraveGPT

BraveGPT is a browser extension that integrates AI-generated chat responses and search result summaries directly into Brave Search using large language models.

โ˜…0.0 (0 ratings)
LLM Models
0
66
OPEN_SOURCETry Now โ†’
H2o AI

H2o AI

H2o AI is a generative AI platform for building, deploying, and managing custom large language model applications across airgapped, onโ€‘premises, and cloud VPC environments.

โ˜…0.0 (0 ratings)
LLM ModelsSupply Chain ManagementData Analytics
0
91
OPEN_SOURCETry Now โ†’
Camel AI

Camel AI

Camel AI is a multi-agent large language model framework and open-source community for developing, coordinating, and studying interactions between specialized AI agents.

โ˜…0.0 (0 ratings)
AI AgentsLLM ModelsAPI Management
0
58
OPEN_SOURCETry Now โ†’

Comments (0)

Please sign in to comment

๐Ÿ’ฌ No comments yet

Be the first to share your thoughts!