Project
ask-eval
Test LLM outputs with Minitest-native assertions. LLM-as-judge for faithfulness, hallucination, bias, toxicity. Deterministic assertions (contains, regex, JSON). CI-native: GitHub annotations, JUnit output, cost tracking.
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026
Development
Licenses
MIT