You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Our goal is to create a plugin-based service that can wrap over Large Language Models to enable transparency and interpretation of results.
We leverage Apache MLFlow LLM Eval APIs for auto-evaluation based on a pre-defined set of metrics based on the task,
This work is still in progress and the full code is not public as of now.