Quit Emailing Yourself

# skills → evaluation → dataset → language-models

1 link tagged with all of: skills + evaluation + dataset + language-models

Click any tag below to further narrow down your results

Links

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

This article introduces FinCDM, a framework for assessing financial large language models (LLMs) by evaluating their knowledge and skills rather than relying on a single score. It highlights the creation of a new dataset, CPA-KQA, based on CPA exam questions, which allows for a more nuanced analysis of LLM capabilities in financial contexts. The framework aims to uncover knowledge gaps and enhance model development for real-world applications.

Saved by tldr-importer · Last saved February 14, 2026 · 2 min read

+ financial language-models ✓ evaluation ✓ skills ✓ dataset ✓