A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.
tools
4b
45.7K Pulls Updated 3 months ago
Updated 3 months ago
3 months ago
9e36e563dbdd · 3.1GB
model
archnemotron
·
parameters4.19B
·
quantizationQ5_K_M
3.1GB
template
{{- if (or .Tools .System) }}<extra_id_0>System
{{ if .System }}{{ .System }}
{{ end }}
{{- if .To
773B
license
NVIDIA AI Foundation Models Community License Agreement
IMPORTANT NOTICE – PLEASE READ AND AGREE B
15kB
Readme
Nemotron-Mini-4B-Instruct is a model for generating responses for roleplaying, retrieval augmented generation, and function calling. It is a small language model (SLM) optimized through distillation, pruning and quantization for speed and on-device deployment.
This instruct model is optimized for roleplay, RAG QA, and function calling in English. It supports a context length of 4,096 tokens. This model is ready for commercial use.