Deepmark AI - LLM benchmarking tool for task-specific metrics on your data

in #steemhunt6 months ago

Deepmark AI

LLM benchmarking tool for task-specific metrics on your data


Screenshots

Screenshot (3).png


Hunter's comment

Deepmark AI is a benchmarking tool that enables assessment of several large language models (LLM) on various extrinsic (task-specific) metrics (e.g. accuracy, relevance, failure rate, latency, etc) on your own data, so your AI apps have reliable performance.


Link

https://github.com/IngestAI/deepmark



Steemhunt.com

This is posted on Steemhunt - A place where you can dig products and earn STEEM.
View on Steemhunt.com

Sort:  

Congratulations!

We have upvoted your post for your contribution within our community.
Thanks again and look forward to seeing your next hunt!

Want to chat? Join us on:

Coin Marketplace

STEEM 0.27
TRX 0.11
JST 0.030
BTC 69093.99
ETH 3768.57
USDT 1.00
SBD 3.44