Bitcoin

Bitcoin

$111,132.20

BTC 2.18%

Ethereum

Ethereum

$2,778.16

ETH 6.37%

  • Login
  • Register
Metaverse Media Group
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
Metaverse Media Group

A New Kind of AI Model Lets Data Owners Take Control

A New Kind of AI Model Lets Data Owners Take Control

Wiredby Wired
9 July 2025
A new kind of large language model, developed by researchers at the Allen Institute for AI (Ai2), makes it possible to control how training data is used even after a model has been built.The new model, called FlexOlmo, could challenge the current industry paradigm of big artificial intelligence companies slurping up data from the web, books, and other sources—often with…
image

A new kind of large language model, developed by researchers at the Allen Institute for AI (Ai2), makes it possible to control how training data is used even after a model has been built.

The new model, called FlexOlmo, could challenge the current industry paradigm of big artificial intelligence companies slurping up data from the web, books, and other sources—often with little regard for ownership—and then owning the resulting models entirely. Once data is baked into an AI model today, extracting it from that model is a bit like trying to recover the eggs from a finished cake.

“Conventionally, your data is either in or out,” says Ali Farhadi, CEO of Ai2, based in Seattle, Washington. “Once I train on that data, you lose control. And you have no way out, unless you force me to go through another multi-million-dollar round of training.”

Ai2’s avant-garde approach divides up training so that data owners can exert control. Those who want to contribute data to a FlexOlmo model can do so by first copying a publicly shared model known as the “anchor.” They then train a second model using their own data, combine the result with the anchor model, and contribute the result back to whoever is building the third and final model.

Contributing in this way means that the data itself never has to be handed over. And because of how the data owner’s model is merged with the final one, it is possible to extract the data later on. A magazine publisher might, for instance, contribute text from its archive of articles to a model but later remove the sub-model trained on that data if there is a legal dispute or if the company objects to how a model is being used.

“The training is completely asynchronous,” says Sewon Min, a research scientist at Ai2 who led the technical work. “Data owners do not have to coordinate, and the training can be done completely independently.”

The FlexOlmo model architecture is what’s known as a “mixture of experts,” a popular design that is normally used to simultaneously combine several sub-models into a bigger, more capable one. A key innovation from Ai2 is a way of merging sub-models that were trained independently. This is achieved using a new scheme for representing the values in a model so that its abilities can be merged with others when the final combined model is run.

To test the approach, the FlexOlmo researchers created a dataset they call Flexmix from proprietary sources including books and websites. They used the FlexOlmo design to build a model with 37 billion parameters, about a tenth of the size of the largest open source model from Meta. They then compared their model to several others. They found that it outperformed any individual model on all tasks and also scored 10 percent better at common benchmarks than two other approaches for merging independently trained models.

The result is a way to have your cake—and get your eggs back, too. “You could just opt out of the system without any major damage and inference time,” Farhadi says. “It’s a whole new way of thinking about how to train these models.”

Percy Liang, an AI researcher at Stanford, says the Ai2 approach seems like a promising idea. “Providing more modular control over data—especially without retraining—is a refreshing direction that challenges the status quo of thinking of language models as monolithic black boxes,” he says. “Openness of the development process—how the model was built, what experiments were run, how decisions were made—is something that’s missing.”

Farhadi and Min say that the FlexOlmo approach might also make it possible for AI firms to access sensitive private data in a more controlled way, because that data does not need to be disclosed in order to build the final model. However, they warn that it may be possible to reconstruct data from the final model, so a technique like differential privacy, which allows data to be contributed with mathematically guaranteed privacy, might be required to ensure data is kept safe.

Ownership of the data used to train large AI models has become a big legal issue in recent years. Some publishers are suing large AI companies while others are cutting deals to grant access to their content. (WIRED parent company Condé Nast has a deal in place with OpenAI.)

In June, Meta won a major copyright infringement case when a federal judge ruled that the company did not violate the law by training its open source model on text from books by 13 authors.

Min says it may well be possible to build new kinds of open models using the FlexOlmo approach. “I really think the data is the bottleneck in building the state of the art models,” she says. “This could be a way to have better shared models where different data owners can codevelop, and they don’t have to sacrifice their data privacy or control.”

Read the full article on Wired.com
in Business
Reading Time: 4 mins read
0
0
21
VIEWS
Share on TwitterShare on Facebook

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now
ADVERTISEMENT

Related Posts

Elon Musk Unveils Grok 4 Amid Controversy Over Chatbot’s Antisemitic Posts
Business

Elon Musk Unveils Grok 4 Amid Controversy Over Chatbot’s Antisemitic Posts

3 hours ago
21
Linda Yaccarino Tried to Tame X. Now She’s Out as CEO
Business

Linda Yaccarino Tried to Tame X. Now She’s Out as CEO

18 hours ago
21
‘People Are Going to Die’: A Malnutrition Crisis Looms in the Wake of USAID Cuts
Business

‘People Are Going to Die’: A Malnutrition Crisis Looms in the Wake of USAID Cuts

1 day ago
21

Comments

Please login to join discussion
ADVERTISEMENT

Latest News

  • All
  • Crypto
  • NFTs
  • Technology
  • Business
Crypto Takes Flight: Emirates and Dubai Duty Free Announce Crypto Payment Plans
Crypto

Crypto Takes Flight: Emirates and Dubai Duty Free Announce Crypto Payment Plans

Bitcoin.com News
by Bitcoin.com News
32 minutes ago
19
This tool strips away anti-AI protections from digital art
Technology

This tool strips away anti-AI protections from digital art

Techonolgy Review
by Techonolgy Review
1 hour ago
19
Tron’s MAGA Moment? Justin Sun Commits $100M to TRUMP Meme Coin
Crypto

Tron’s MAGA Moment? Justin Sun Commits $100M to TRUMP Meme Coin

Bitcoin.com News
by Bitcoin.com News
2 hours ago
21
KULR Mining Hits 750 PH/s With New Bitmain Mining Rigs Stationed in Paraguay
Crypto

KULR Mining Hits 750 PH/s With New Bitmain Mining Rigs Stationed in Paraguay

Bitcoin.com News
by Bitcoin.com News
3 hours ago
20
Elon Musk Unveils Grok 4 Amid Controversy Over Chatbot’s Antisemitic Posts
Business

Elon Musk Unveils Grok 4 Amid Controversy Over Chatbot’s Antisemitic Posts

Wired
by Wired
3 hours ago
21
Remixpoint Commits $215 Million to Bitcoin, Targets 3,000 BTC Reserve
Crypto

Remixpoint Commits $215 Million to Bitcoin, Targets 3,000 BTC Reserve

Bitcoin.com News
by Bitcoin.com News
4 hours ago
21
Load More
Next Post
Meta Acquires $3.5 Billion Stake in Ray-Ban Maker to Expand AI Glasses Strategy

Meta Acquires $3.5 Billion Stake in Ray-Ban Maker to Expand AI Glasses Strategy

ADVERTISEMENT

Follow Us

Categories

  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
Subscribe to our Newsletter

© 2022 Metaverse Media Group – The Metaverse Mecca

Privacy and Cookie Policy | Sitemap

Welcome Back!

Sign In with Google
OR

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Sign Up with Google
OR

Fill the forms below to register

*By registering into our website, you agree to the Terms & Conditions and Privacy Policy.
All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
Bitcoin

Bitcoin

$111,132.20

BTC 2.18%

Ethereum

Ethereum

$2,778.16

ETH 6.37%

  • Login
  • Sign Up
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.

Subscribe to our newsletter

Get the latest news & win monthly prizes

Subscribe to our newsletter

For the Latest News and Monthly Prize Giveaways

Join Now
Join Now