Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
The demo opens a Neutralino app. Clicking on the blue link sends a Ping to Python, which replies with Pong. This illustrates the data-flow in both directions. Before running the demo, adapt the path ...