Fin-R1 — Open-Source Financial Reasoning LLM
Fin-R1 is a 7B-parameter open-source financial reasoning model based on Qwen2.5-7B-Instruct, designed for complex reasoning tasks across banking, securities, insurance and trust businesses.
Approach
- Distilled 60K financial chain-of-thought (CoT) samples from DeepSeek-R1.
- Two-round quality filtering: rule-based + model scoring + logical-consistency verification.
- Two-stage training: SFT for behavior cloning, then GRPO for reasoning refinement.
- Verifier-based reward signal to improve reliability.
Results
- FinEval 75.2 — 2nd place overall.
- Near-SOTA on FinQA (76.0) and ConvFinQA (85.0).
- Outperforms all comparable open models at the 7B scale.
- One-click deployment via vLLM.
