MetaLlama 3.1Open SourceSep 15, 2024
NeverSleep: Lumimaid v0.2 8B
Lumimaid v0.2 8B is a finetune of Llama 3.1 8B with a "HUGE step up dataset wise" compared to Lumimaid v0.1. Sloppy chats output were purged. Usage of this model is subject to Meta's Acceptable Use Policy.
Context Window
33K
tokens
Max Output
4K
tokens
Released
Sep 15, 2024
Arena Rank
—
Capabilities
👁Vision
🧠Reasoning
🔧Tool Calling
⚡Prompt Caching
🖥Computer Use
🎨Image Generation
Supported Parameters
Frequency Penalty
Reduce repetition
Max Tokens
Output length limit
Presence Penalty
Encourage new topics
Response Format
JSON mode / structured output
Stop Sequences
Custom stop tokens
structured_outputs
Temperature
Controls randomness
Top P
Nucleus sampling
Pricing Comparison
| Router | Input / 1M | Output / 1M | Cached Input / 1M |
|---|---|---|---|
| OpenRouter | $0.09 | $0.60 | — |
| Martian | $0.09 | $0.60 | — |
Benchmarks
Open LLM Leaderboard
AverageOpen LLM Leaderboard
24.41/100IFEvalOpen LLM Leaderboard
50.38/100BBHOpen LLM Leaderboard
31.96/100MATH Lvl 5Open LLM Leaderboard
14.35/100GPQAOpen LLM Leaderboard
8.17/100MUSROpen LLM Leaderboard
12.32/100MMLU-PROOpen LLM Leaderboard
29.29/100Model IDs
OpenRouter
neversleep/llama-3.1-lumimaid-8bHugging FaceNeverSleep/Lumimaid-v0.2-8B ↗
Related Models
Meta#133
Meta: Llama 3.1 405B (base)
33K ctx$0.40/1M in
Meta#178
NVIDIA: Llama 3.1 Nemotron 70B Instruct
131K ctx$1.20/1M in
Meta#180
Meta: Llama 3.1 70B Instruct
131K ctx$0.40/1M in
Meta#232
Meta: Llama 3.1 8B Instruct
16K ctx$0.02/1M in
Meta
Meta Llama 3.1 70B Instruct Turbo
131K ctx$0.88/1M in
Meta
Meta Llama 3.1 8B Instruct Turbo
131K ctx$0.18/1M in