1 link tagged with all of: open-source + evaluation + ai + research
Click any tag below to further narrow down your results
Links
Bloom is an open source framework that automates the evaluation of AI model behaviors, allowing researchers to specify a desired behavior and generate relevant scenarios for assessment. The tool produces evaluations quickly and offers flexibility in measuring different behavioral traits, complementing existing tools like Petri.