NVIDIA: Llama 3.1 Nemotron 70B Instruct
nvidia/llama-3.1-nemotron-70b-instruct
Created Oct 15, 2024131,072 context
$1.20/M input tokens$1.20/M output tokens
Prompt tokens measure input size. Reasoning tokens show internal thinking before a response. Completion tokens reflect total output length.