Athene-V2 is a 72B parameter model which excels at code completion, mathematics, and log extraction tasks.

tools 72b

59.4K 5 weeks ago

Readme

Athene-V2

Nexusflow’s Athene-V2 chat model, built on Qwen 2.5’s 72B foundation, achieves GPT-4o-level performance across key benchmarks while demonstrating how targeted optimization can enhance specific capabilities beyond traditional scaling approaches.

Model Features

  • 72B parameters fine-tuned from Qwen 2.5
  • State-of-the-art chat performance matching or exceeding GPT-4o
  • Superior code completion (ranking #2 on bigcode-bench-hard)
  • Enhanced mathematics capabilities (MATH benchmark)
  • Precise long-form log extraction
  • Advanced post-training pipeline pushing the Pareto frontier

References

Blog post

HuggingFace