Bitcoin

Bitcoin

$119,316.27

BTC 1.90%

Ethereum

Ethereum

$3,743.16

ETH 0.10%

  • Login
  • Register
Metaverse Media Group
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
Metaverse Media Group

Google’s Gemini 2.5 now supports “conversational image segmentation”

Google’s Gemini 2.5 now supports “conversational image segmentation”

The Decoderby The Decoder
22 July 2025
Google has introduced a new feature for its Gemini 2.5 AI model that allows users to analyze and highlight image content directly through natural language prompts. The article Google’s Gemini 2.5 now supports “conversational image segmentation” appeared first on THE DECODER….

summary
Summary

Google has introduced a new feature for its Gemini 2.5 AI model that allows users to analyze and highlight image content directly through natural language prompts.

This “conversational image segmentation” goes beyond traditional image segmentation, which typically identifies objects using fixed categories like “dog,” “car,” or “chair.” Now, Gemini can understand more complex language and apply it to specific parts of an image. The model handles relational queries such as “the person with the umbrella,” logic-based instructions like “all people who are not sitting,” and even abstract concepts such as “clutter” or “damage” that don’t have a clear visual outline. Gemini can also identify image elements that require reading on-screen text – for example, “the pistachio baklava” in a display case – thanks to built-in text recognition. The feature supports multilingual prompts and can provide object labels in other languages, such as French, if needed.

Image: Google
Recommend our article

Practical applications

According to Google, this technology can be used in a range of fields. In image editing, for example, designers no longer need to use a mouse or selection tools; they can simply say what they want to select, such as “select the building’s shadow.”

For workplace safety, Gemini can scan photos or videos for violations, like “all people on the construction site without a helmet.”

THE DECODER Newsletter
The most important AI news straight to your inbox.
✓ Weekly
✓ Cancel at any time

The feature is also useful in insurance: an adjuster could issue a command like “highlight all houses with storm damage” to automatically tag damaged buildings in aerial images, saving time compared to checking each property manually.

Image: Google

No special models required

Developers can access the feature through the Gemini API. All requests are handled directly by the Gemini model, which is equipped with this capability.

Results are returned in JSON format, including the coordinates of the selected image area (box_2d), a pixel mask (mask), and the descriptive label (label).

For best results, Google recommends using the gemini-2.5-flash model and setting the “thinkingBudget” parameter to zero to trigger an immediate response.

Initial tests are possible via Google AI Studio or Python Colab.

Join our community
Join the DECODER community on Discord, Reddit or Twitter – we can’t wait to meet you.

Recommendation
Read the full article on The-Decoder.com
in AI
Reading Time: 3 mins read
0
0
21
VIEWS
Share on TwitterShare on Facebook

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now
ADVERTISEMENT

Related Posts

AI Math Olympiad wins revive the debate over symbols, reasoning, and the nature of intelligence
AI

AI Math Olympiad wins revive the debate over symbols, reasoning, and the nature of intelligence

10 hours ago
21
Anthropic’s CEO admits compromising with authoritarian regimes to secure AI funding
AI

Anthropic’s CEO admits compromising with authoritarian regimes to secure AI funding

11 hours ago
21
OpenAI’s new agent moves its 2017 vision for AI closer to reality
AI

OpenAI’s new agent moves its 2017 vision for AI closer to reality

11 hours ago
20

Comments

Please login to join discussion
ADVERTISEMENT

Latest News

  • All
  • Crypto
  • NFTs
  • Technology
  • Business
Nubank Announces New Nucoin Loyalty Program
Crypto

Nubank Announces New Nucoin Loyalty Program

Bitcoin.com News
by Bitcoin.com News
51 minutes ago
21
US Senators Unveil Draft Bill to Overhaul Crypto Regulation Framework
Crypto

US Senators Unveil Draft Bill to Overhaul Crypto Regulation Framework

Bitcoin.com News
by Bitcoin.com News
2 hours ago
21
Treasury Secretary Scott Bessent Calls for Full Fed Audit
Crypto

Treasury Secretary Scott Bessent Calls for Full Fed Audit

Bitcoin.com News
by Bitcoin.com News
3 hours ago
22
DOJ Seeks Direct Meeting With Ghislaine Maxwell Amid Epstein Probe
Crypto

DOJ Seeks Direct Meeting With Ghislaine Maxwell Amid Epstein Probe

Bitcoin.com News
by Bitcoin.com News
4 hours ago
20
Stargate Unravels: Report Claims Softbank and OpenAI Tensions Stall AI Ambitions
Crypto

Stargate Unravels: Report Claims Softbank and OpenAI Tensions Stall AI Ambitions

Bitcoin.com News
by Bitcoin.com News
5 hours ago
21
Profusa Launches $100 Million Bitcoin Treasury Strategy
Crypto

Profusa Launches $100 Million Bitcoin Treasury Strategy

Bitcoin.com News
by Bitcoin.com News
6 hours ago
22
Load More
Next Post
OpenAI’s new agent moves its 2017 vision for AI closer to reality

OpenAI’s new agent moves its 2017 vision for AI closer to reality

ADVERTISEMENT

Follow Us

Categories

  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
Subscribe to our Newsletter

© 2022 Metaverse Media Group – The Metaverse Mecca

Privacy and Cookie Policy | Sitemap

Welcome Back!

Sign In with Google
OR

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Sign Up with Google
OR

Fill the forms below to register

*By registering into our website, you agree to the Terms & Conditions and Privacy Policy.
All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
Bitcoin

Bitcoin

$119,316.27

BTC 1.90%

Ethereum

Ethereum

$3,743.16

ETH 0.10%

  • Login
  • Sign Up
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.

Subscribe to our newsletter

Get the latest news & win monthly prizes

Subscribe to our newsletter

For the Latest News and Monthly Prize Giveaways

Join Now
Join Now