Eblogtip.com
  • Categories
    • News
    • Technology
    • Domains
    • Hosting
    • Promotions

Archives

  • September 2023
  • August 2023
  • July 2023
  • June 2023
  • May 2023
  • December 2022

Categories

  • News
  • Technology
  • Uncategorized
eBlogTip
  • Categories
    • News
    • Technology
    • Domains
    • Hosting
    • Promotions
  • Technology

Arthur releases open source tool to help companies find the best LLM for a job

  • August 17, 2023
Total
0
Shares
0
0
0

Arthur, a machine learning morning monitoring startup, has benefited from the interest in generative AI this year, and it has been developing tools to help companies work with LLMs more effectively. Today it is releasing Arthur Bench, an open source tool to help users find the best LLM for a particular set of data.

Adam Wenchel, CEO and co-founder at Arthur says that the company has seen a lot of interest in generative AI and LLMs, and so they have been putting a lot of effort into creating products.

He says that today, and granted we are less than a year since the release of ChatGPT, that companies don’t have an organized way to measure the effectiveness of one tool against another, and that’s why they created Arthur Bench.

“Arthur Bench solves one of the critical problems that we just hear with every customer which is [with all of the model choices], which one is best for your particular application,” Wenchel told TechCrunch.

It comes with a suite of tools you can use to methodically test the performance, but the real value is that it allows you to test and measure how the types of prompts your users would use for your particular application will perform against different LLMs.

Arthur Bench LLM comparison test suite hedging test.

Image Credits: Arthur

“You could potentially test 100 different prompts, and then see how two different LLMs – like how Anthropic compares to OpenAI – on the kinds of prompts that your users are likely to use,” Wenchel said. What’s more, he says that you can do that at scale and make a better decision on which model is best for your particular use case.

Arthur Bench is being released today as an open source tool. There will also be a SaaS version for customers who don’t want to deal with complexity of managing the open source version, or who have larger test requirements, and are willing to pay for that. But for now, Wenchel said they are concentrating on the open source project.

The new tool comes on the heels of the release of Arthur Shield in May, a kind of LLM firewall that is designed to detect hallucinations in models, while protecting against toxic information and private data leaks.


Source link

Total
0
Shares
Share 0
Tweet 0
Pin it 0
Previous Article
  • Technology

Yard Stick lands $10.6M Series A to measure soil carbon

  • August 17, 2023
View Post
Next Article
  • News

The Texas Chain Saw Massacre publisher CEO says demanding players “ain’t it”

  • August 17, 2023
View Post
You May Also Like
View Post
  • Technology

Disability tech startups kill the cynic in me

  • September 23, 2023
View Post
  • Technology

Walmart’s PhonePe launches app store with zero fee in challenge to Google

  • September 23, 2023
View Post
  • Technology

How CFOs can reduce SaaS spend by 30% in these tough times

  • September 22, 2023
View Post
  • Technology

LimeLoop’s sleek reusable mailers seek to replace cardboard boxes

  • September 22, 2023
View Post
  • Technology

AquaLith might have an answer to the US battery material shortage problem

  • September 22, 2023
View Post
  • Technology

Bay Area baby belly beholding Battlefield bounty

  • September 22, 2023
View Post
  • Technology

Unity U-turns on controversial runtime fee and begs forgiveness

  • September 22, 2023
View Post
  • Technology

Pitch Deck Teardown: Transcend’s $20M Series B deck

  • September 22, 2023

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

eBlogTip.com
  • Categories

Input your search keywords and press Enter.