NVIDIA: Llama 3.1 Nemotron 70B Instruct

nvidia/llama-3.1-nemotron-70b-instruct

Released Oct 15, 2024131,072 context

$1.20/M input tokens$1.20/M output tokens

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging Llama 3.1 70B architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains.

Usage of this model is subject to Meta's Acceptable Use Policy.

OpenRouter

Product

Chat
Rankings
Models
Providers
Pricing
Enterprise

Company

About
Announcements
CareersHiring
Privacy
Terms of Service
Support
State of AI
Works With OR

Developer

Documentation
API Reference
SDK
Status

Connect

Discord
GitHub
LinkedIn
X
YouTube

NVIDIA: Llama 3.1 Nemotron 70B Instruct

nvidia/llama-3.1-nemotron-70b-instruct

Released Oct 15, 2024131,072 context

$1.20/M input tokens$1.20/M output tokens

Usage of this model is subject to Meta's Acceptable Use Policy.

Recent activity on Llama 3.1 Nemotron 70B Instruct

Total usage per day on OpenRouter

Prompt

8.88M

Completion

1.58M