General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.
7b
13b
33b
166.8K Pulls Updated 14 months ago
Updated 14 months ago
14 months ago
86f0704901a4 · 18GB
model
archllama
·
parameters32.5B
·
quantizationQ4_0
18GB
params
{
"stop": [
"USER:",
"ASSISTANT:"
]
}
31B
template
{{ .System }}
USER: {{ .Prompt }}
ASSISTANT:
45B
system
A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful,
154B
Readme
Vicuna is a chat assistant model. It includes 3 different variants in 3 different sizes. v1.3 is trained by fine-tuning Llama and has a context size of 2048 tokens. v1.5 is trained by fine-tuning Llama 2 and has a context size of 2048 tokens. v1.5-16k is trained by fine-tuning Llama 2 and has a context size of 16k tokens. All three variants are trained using conversations collected from ShareGPT.
Example prompts
What is the meaning of life? Explain it in 5 paragraphs.