July 24, 2023

StabilityAI and CarperAI Lab Introduce Open-Source LLM FreeWilly with Enhanced Reasoning Capabilities

by Damir Yalalov

Published: July 24, 2023 at 4:34 am Updated: July 24, 2023 at 4:34 am

by Danil Myakin

Edited and fact-checked: July 24, 2023 at 4:34 am

StabilityAI and CarperAI team has unveiled two new open-source Large Language Models (LLMs) named FreeWilly1 and FreeWilly2. These models stand out in the field of LLMs due to their enhanced reasoning capabilities.

Stability AI and CarperAI Lab Introduce FreeWilly with Enhanced Reasoning Capabilities — Credit: PR Newswire

FreeWilly1 is constructed on the LLaMA 65B model and has undergone fine-tuning with a synthetically generated dataset. FreeWilly2 is built on the LLaMA 2 70B model and exhibits performance comparable to GPT-3.5 for certain tasks. The training methodologies for these models were influenced by Microsoft’s research, as detailed in their paper titled “Orca: Progressive Learning from Complex Explanation Traces of GPT-4.” Stability AI’s approach involved prompting language models with high-quality instructions to create a dataset containing 600,000 data points. This dataset size is approximately 10% of what was used in the original Orca research. Despite this reduced dataset size, the FreeWilly models have shown exceptional performance across various benchmarks.

The data generation process involved creating 500,000 cases using a less intricate LLM model and an additional 100,000 cases with a more complex LLM model. To ensure valid comparisons, the datasets were meticulously screened to remove cases that originated from evaluation benchmarks. The effectiveness of this synthetically generated dataset is evident in the FreeWilly models’ performance, even though they were trained on a dataset only a tenth the size of the original Orca paper.

For the evaluation of these models, the researchers employed EleutherAI, supplemented with AGIEval. The findings indicate that both FreeWilly models excel in addressing challenging issues in specialized fields such as law and mathematics. They also demonstrate intricate reasoning and a keen understanding of language nuances. The CarperAI team is optimistic about the potential of these models to enhance our comprehension of spoken language and is eager to witness their innovative applications in the field of artificial intelligence.

For a comprehensive understanding of FreeWilly1 and FreeWilly2, the Reference Article and Project Page provide detailed insights.

LLaMa-2: A New Era in Public Domain Language Models

LLaMa-2 stands as the premier language model in the public domain today, paving the way for the continued evolution and deployment of Large Language Models (LLMs) across various products. Its predecessor, LLaMa-1, laid the foundation by inspiring numerous impactful projects. With the introduction of LLaMa-2, the prospects for utilization in diverse applications are even greater, especially given its provision for free commercial use.

In a recent dialogue with the BBC, Nick Clegg, a notable figure from Meta, discussed the decision to release LLMs as open-source. According to Clegg, such a move enhances the safety of these models, primarily because it facilitates in-depth research and analysis from external entities.

Some key observations from Clegg include:

LLaMa-2 sets a new standard in security amongst open-source models. This assertion finds support in the benchmarks mentioned in the linked article.
Addressing concerns about potential existential threats posed by AI, Clegg opined that the discourse might be slightly ahead of the actual technological capabilities. He underlined that most concerns are tied to hypothetical ultra-advanced AI models — those that possess unparalleled intelligence, autonomy, and self-replicating abilities. In stark contrast, Clegg described the open-sourced models from Meta, including LLaMa-2, as markedly rudimentary.
While he firmly believes in the regulation of AI, Clegg emphasized that it’s not imperative for every AI model to be open-source.

Meta’s commitment to transparency and contribution to the broader community is evident in their decade-long track record. Over the last ten years, the company has made available over 1000 models, libraries, and datasets for public use. Prominent releases include React, PyTorch, and the more recent ‘Segment Anything‘ model.

Recently, Meta has released LLaMa-2-Chat models, a significant breakthrough in open-source AI. These models, with 70 billion parameters, are comparable to GPT-3.5 and surpass benchmarks. They are fine-tuned using RLHF (Reinforcement Learning from Human Feedback) and offer personalized ChatGPT equivalents, human evaluation metrics, and mathematical problem-solving capabilities. The model is the first of its size to be fine-tuned using RLHF, making it even more notable. Meta has made this model entirely free for commercial use. One significant advantage of LLaMa-2-Chat is its potential to create ChatGPT analogues without sharing any data with OpenAI, allowing developers and researchers to harness the model’s power while maintaining complete control over their data.

Read more about AI:

Tags:

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Damir is the team leader, product manager, and editor at Metaverse Post, covering topics such as AI/ML, AGI, LLMs, Metaverse, and Web3-related fields. His articles attract a massive audience of over a million users every month. He appears to be an expert with 10 years of experience in SEO and digital marketing. Damir has been mentioned in Mashable, Wired, Cointelegraph, The New Yorker, Inside.com, Entrepreneur, BeInCrypto, and other publications. He travels between the UAE, Turkey, Russia, and the CIS as a digital nomad. Damir earned a bachelor's degree in physics, which he believes has given him the critical thinking skills needed to be successful in the ever-changing landscape of the internet.

Damir Yalalov

Hot Stories

News Report Technology

CODESPECT Rolls Out SpecSiege, A Curated Audit Contest Platform For Web3 Security

by Alisa Davidson

July 27, 2026

Opinion Technology

At 30× Less Compute, Induction Labs’ ‘Imagination Model’ Outperforms Google By Watching

by Alisa Davidson

July 27, 2026

News Report Technology

Beyond The Signal: KuCoin Marks 9th Anniversary With Multi-Track Competition Offering Up To 650,000 USDT

by Alisa Davidson

July 27, 2026

Digest News Report Technology

Gate Update: CXMT Futures Launch On 470% IPO Surge, Exchange Leads CEX Inflows With $288M

by Alisa Davidson

July 27, 2026

StabilityAI and CarperAI Lab Introduce Open-Source LLM FreeWilly with Enhanced Reasoning Capabilities

LLaMa-2: A New Era in Public Domain Language Models

Disclaimer

About The Author

CODESPECT Rolls Out SpecSiege, A Curated Audit Contest Platform For Web3 Security

At 30× Less Compute, Induction Labs’ ‘Imagination Model’ Outperforms Google By Watching

Beyond The Signal: KuCoin Marks 9th Anniversary With Multi-Track Competition Offering Up To 650,000 USDT

Gate Update: CXMT Futures Launch On 470% IPO Surge, Exchange Leads CEX Inflows With $288M

CODESPECT Rolls Out SpecSiege, A Curated Audit Contest Platform For Web3 Security

Beyond The Signal: KuCoin Marks 9th Anniversary With Multi-Track Competition Offering Up To 650,000 USDT

Gate Update: CXMT Futures Launch On 470% IPO Surge, Exchange Leads CEX Inflows With $288M

Strict 2x Crypto Leverage Limits Driving Capital Away From Japan, Senior LDP Official Warns

How Minmax Is Building The Professional AI Trading Terminal Prediction Markets Still Lack In 2026

The Calm Before The Solana Storm: What Charts, Whales, And On-Chain Signals Are Saying Now