News Report Technology
July 24, 2023

StabilityAI and CarperAI Lab Introduce Open-Source LLM FreeWilly with Enhanced Reasoning Capabilities

StabilityAI and CarperAI team has unveiled two new open-source Large Language Models (LLMs) named FreeWilly1 and FreeWilly2. These models stand out in the field of LLMs due to their enhanced reasoning capabilities.

Stability AI and CarperAI Lab Introduce FreeWilly with Enhanced Reasoning Capabilities
Credit: PR Newswire

FreeWilly1 is constructed on the LLaMA 65B model and has undergone fine-tuning with a synthetically generated dataset. FreeWilly2 is built on the LLaMA 2 70B model and exhibits performance comparable to GPT-3.5 for certain tasks. The training methodologies for these models were influenced by Microsoft’s research, as detailed in their paper titled “Orca: Progressive Learning from Complex Explanation Traces of GPT-4.” Stability AI’s approach involved prompting language models with high-quality instructions to create a dataset containing 600,000 data points. This dataset size is approximately 10% of what was used in the original Orca research. Despite this reduced dataset size, the FreeWilly models have shown exceptional performance across various benchmarks.

The data generation process involved creating 500,000 cases using a less intricate LLM model and an additional 100,000 cases with a more complex LLM model. To ensure valid comparisons, the datasets were meticulously screened to remove cases that originated from evaluation benchmarks. The effectiveness of this synthetically generated dataset is evident in the FreeWilly models’ performance, even though they were trained on a dataset only a tenth the size of the original Orca paper.

For the evaluation of these models, the researchers employed EleutherAI, supplemented with AGIEval. The findings indicate that both FreeWilly models excel in addressing challenging issues in specialized fields such as law and mathematics. They also demonstrate intricate reasoning and a keen understanding of language nuances. The CarperAI team is optimistic about the potential of these models to enhance our comprehension of spoken language and is eager to witness their innovative applications in the field of artificial intelligence.

For a comprehensive understanding of FreeWilly1 and FreeWilly2, the Reference Article and Project Page provide detailed insights.

LLaMa-2: A New Era in Public Domain Language Models

LLaMa-2 stands as the premier language model in the public domain today, paving the way for the continued evolution and deployment of Large Language Models (LLMs) across various products. Its predecessor, LLaMa-1, laid the foundation by inspiring numerous impactful projects. With the introduction of LLaMa-2, the prospects for utilization in diverse applications are even greater, especially given its provision for free commercial use.

In a recent dialogue with the BBC, Nick Clegg, a notable figure from Meta, discussed the decision to release LLMs as open-source. According to Clegg, such a move enhances the safety of these models, primarily because it facilitates in-depth research and analysis from external entities.

Some key observations from Clegg include:

  • LLaMa-2 sets a new standard in security amongst open-source models. This assertion finds support in the benchmarks mentioned in the linked article.
  • Addressing concerns about potential existential threats posed by AI, Clegg opined that the discourse might be slightly ahead of the actual technological capabilities. He underlined that most concerns are tied to hypothetical ultra-advanced AI models — those that possess unparalleled intelligence, autonomy, and self-replicating abilities. In stark contrast, Clegg described the open-sourced models from Meta, including LLaMa-2, as markedly rudimentary.
  • While he firmly believes in the regulation of AI, Clegg emphasized that it’s not imperative for every AI model to be open-source.

Meta’s commitment to transparency and contribution to the broader community is evident in their decade-long track record. Over the last ten years, the company has made available over 1000 models, libraries, and datasets for public use. Prominent releases include React, PyTorch, and the more recent ‘Segment Anything‘ model.

  • Recently, Meta has released LLaMa-2-Chat models, a significant breakthrough in open-source AI. These models, with 70 billion parameters, are comparable to GPT-3.5 and surpass benchmarks. They are fine-tuned using RLHF (Reinforcement Learning from Human Feedback) and offer personalized ChatGPT equivalents, human evaluation metrics, and mathematical problem-solving capabilities. The model is the first of its size to be fine-tuned using RLHF, making it even more notable. Meta has made this model entirely free for commercial use. One significant advantage of LLaMa-2-Chat is its potential to create ChatGPT analogues without sharing any data with OpenAI, allowing developers and researchers to harness the model’s power while maintaining complete control over their data.

Read more about AI:

Tags:

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Damir is the team leader, product manager, and editor at Metaverse Post, covering topics such as AI/ML, AGI, LLMs, Metaverse, and Web3-related fields. His articles attract a massive audience of over a million users every month. He appears to be an expert with 10 years of experience in SEO and digital marketing. Damir has been mentioned in Mashable, Wired, Cointelegraph, The New Yorker, Inside.com, Entrepreneur, BeInCrypto, and other publications. He travels between the UAE, Turkey, Russia, and the CIS as a digital nomad. Damir earned a bachelor's degree in physics, which he believes has given him the critical thinking skills needed to be successful in the ever-changing landscape of the internet. 

More articles
Damir Yalalov
Damir Yalalov

Damir is the team leader, product manager, and editor at Metaverse Post, covering topics such as AI/ML, AGI, LLMs, Metaverse, and Web3-related fields. His articles attract a massive audience of over a million users every month. He appears to be an expert with 10 years of experience in SEO and digital marketing. Damir has been mentioned in Mashable, Wired, Cointelegraph, The New Yorker, Inside.com, Entrepreneur, BeInCrypto, and other publications. He travels between the UAE, Turkey, Russia, and the CIS as a digital nomad. Damir earned a bachelor's degree in physics, which he believes has given him the critical thinking skills needed to be successful in the ever-changing landscape of the internet. 

Hot Stories
Join Our Newsletter.
Latest News

Institutional Appetite Grows Toward Bitcoin ETFs Amid Volatility

Disclosures through 13F filings reveal notable institutional investors dabbling in Bitcoin ETFs, underscoring a growing acceptance of ...

Know More

Sentencing Day Arrives: CZ’s Fate Hangs in Balance as US Court Considers DOJ’s Plea

Changpeng Zhao is poised to face sentencing in a U.S. court in Seattle today.

Know More
Join Our Innovative Tech Community
Read More
Read more
Donald Trump’s Shift to Crypto: From Opponent to Advocate, and What It Means for the U.S. Cryptocurrency Market
Business Markets Stories and Reviews Technology
Donald Trump’s Shift to Crypto: From Opponent to Advocate, and What It Means for the U.S. Cryptocurrency Market
May 10, 2024
Layer3 To Launch L3 Token This Summer, Allocating 51% Of Total Supply To Community
Markets News Report Technology
Layer3 To Launch L3 Token This Summer, Allocating 51% Of Total Supply To Community
May 10, 2024
Edward Snowden’s Final Warning to Bitcoin Developers: “Make Privacy a Protocol-Level Priority or Risk Losing It
Markets Security Wiki Software Stories and Reviews Technology
Edward Snowden’s Final Warning to Bitcoin Developers: “Make Privacy a Protocol-Level Priority or Risk Losing It
May 10, 2024
Optimism-Powered Ethereum Layer 2 Network Mint To Launch Its Mainnet On May 15
News Report Technology
Optimism-Powered Ethereum Layer 2 Network Mint To Launch Its Mainnet On May 15
May 10, 2024