
Instella
Instella is a fully open 3B-parameter language model from AMD that generates and understands text, with openly released weights, training code, data, and hyperparameters.
Instella is a fully open 3-billion-parameter language model family designed to deliver strong general-purpose language understanding and generation. Developed and trained by AMD on Instinctโข MI300X GPUs, it targets researchers, developers, and organizations that require transparent, reproducible, and high-performing models without restrictive licensing. Instella-3B provides a foundation for tasks such as text generation, summarization, code assistance, and conversational AI in both research and production environments.
Instella stands out through its complete openness: AMD releases model weights, training hyperparameters, datasets, and code, enabling full inspection, customization, and extension. The models are trained from scratch using the AMD ROCm software stack and modern efficiency techniques including FlashAttention-2, Torch Compile, and Fully Sharded Data Parallelism (FSDP) with hybrid sharding for scalable training across large GPU clusters. Benchmarks show that Instella-3B significantly outperforms existing fully open models of similar size and delivers competitive performance relative to open-weight models such as Llama-3.2-3B, Gemma-2-2B, and Qwen-2.5-3B, including their instruction-tuned variants.
Tags
Launch Team
Alternatives & Similar Tools
Explore 453+ top alternatives to Instella

HipoCap
Securely govern AI agents by enforcing RBAC, blocking prompt injection, and monitoring all tool executions in real time with open-source observability and controls.

Llama
Llama is a family of open-source large language models for text and multimodal understanding, generation, and reasoning, designed for integration into applications and services.
Kimi K2 Thinking
Kimi K2 Thinking is an open-source AI reasoning and experimentation framework designed to explore an
ChatArena
ChatArena is a framework for creating and running multi-agent language game environments that evaluate and develop communication and collaboration behaviors in large language models.

OpenAGI
OpenAGI is a research framework that integrates large language models with domain-specific tools and expert knowledge to build, evaluate, and improve task-oriented AI agents.

BraveGPT
BraveGPT is a browser extension that integrates AI-generated chat responses and search result summaries directly into Brave Search using large language models.
H2o AI
H2o AI is a generative AI platform for building, deploying, and managing custom large language model applications across airgapped, onโpremises, and cloud VPC environments.

Camel AI
Camel AI is a multi-agent large language model framework and open-source community for developing, coordinating, and studying interactions between specialized AI agents.
Comments (0)
Please sign in to comment
๐ฌ No comments yet
Be the first to share your thoughts!