Friday, 20 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Apple researchers develop AI that can ‘see’ and understand screen context
AI

Apple researchers develop AI that can ‘see’ and understand screen context

Last updated: April 1, 2024 7:48 pm
Published April 1, 2024
Share
Apple researchers develop AI that can 'see' and understand screen context
SHARE

Be part of us in Atlanta on April tenth and discover the panorama of safety workforce. We are going to discover the imaginative and prescient, advantages, and use circumstances of AI for safety groups. Request an invitation right here.


Apple researchers have developed a brand new synthetic intelligence system that may perceive ambiguous references to on-screen entities in addition to conversational and background context, enabling extra pure interactions with voice assistants, in line with a paper revealed on Friday.

The system, referred to as ReALM (Reference Resolution As Language Modeling), leverages massive language fashions to transform the complicated activity of reference decision — together with understanding references to visible components on a display screen — right into a pure language modeling downside. This enables ReALM to attain substantial efficiency features in comparison with present strategies.

“With the ability to perceive context, together with references, is crucial for a conversational assistant,” wrote the staff of Apple researchers. “Enabling the person to problem queries about what they see on their display screen is an important step in guaranteeing a real hands-free expertise in voice assistants.”

Enhancing conversational assistants

To sort out screen-based references, a key innovation of ReALM is reconstructing the display screen utilizing parsed on-screen entities and their places to generate a textual illustration that captures the visible format. The researchers demonstrated that this method, mixed with fine-tuning language fashions particularly for reference decision, may outperform GPT-4 on the duty.

VB Occasion

The AI Impression Tour – Atlanta

Persevering with our tour, we’re headed to Atlanta for the AI Impression Tour cease on April tenth. This unique, invite-only occasion, in partnership with Microsoft, will function discussions on how generative AI is reworking the safety workforce. Area is proscribed, so request an invitation right this moment.

See also  Telefónica's Wayra backs AI answer engine Perplexity

Request an invitation

Apple’s AI system, ReALM, can perceive references to on-screen entities just like the “260 Pattern Sale” itemizing proven on this mockup, enabling extra pure interactions with voice assistants. (Picture Credit score: arxiv.org)

“We reveal massive enhancements over an present system with comparable performance throughout various kinds of references, with our smallest mannequin acquiring absolute features of over 5% for on-screen references,” the researchers wrote. “Our bigger fashions considerably outperform GPT-4.”

Sensible functions and limitations

The work highlights the potential for targeted language fashions to deal with duties like reference decision in manufacturing programs the place utilizing large end-to-end fashions is infeasible on account of latency or compute constraints. By publishing the analysis, Apple is signaling its persevering with investments in making Siri and different merchandise extra conversant and context-aware.

Nonetheless, the researchers warning that counting on automated parsing of screens has limitations. Dealing with extra complicated visible references, like distinguishing between a number of photographs, would seemingly require incorporating pc imaginative and prescient and multi-modal strategies.

Apple races to shut AI hole as rivals soar

Apple is quietly making vital strides in synthetic intelligence analysis, even because it trails tech rivals within the race to dominate the fast-moving AI panorama.

From multimodal fashions that mix imaginative and prescient and language, to AI-powered animation instruments, to strategies for constructing high-performing specialised AI on a funds, a gentle drumbeat of breakthroughs from the corporate’s analysis labs recommend its AI ambitions are quickly escalating.

However the famously secretive tech big faces stiff competitors from the likes of Google, Microsoft, Amazon and OpenAI, who’ve aggressively productized generative AI in search, workplace software program, cloud providers and extra.

Apple, lengthy a quick follower somewhat than a primary mover, now confronts a market being remodeled at breakneck pace by synthetic intelligence. At its carefully watched Worldwide Developers Conference in June, the corporate is predicted to unveil a brand new massive language mannequin framework, an “Apple GPT” chatbot, and different AI-powered options throughout its ecosystem.

See also  Google’s Jules aims to out-code Codex in battle for the AI developer stack

“We’re excited to share particulars of our ongoing work in AI later this 12 months,” CEO Tim Cook recently hinted on an earnings name. Regardless of its attribute opacity, it’s clear Apple’s AI efforts are sweeping in scope.

But because the battle for AI supremacy heats up, the iPhone maker’s lateness to the get together has put it in an uncharacteristic place of weak spot. Deep coffers, model loyalty, elite engineering and a tightly built-in product portfolio give it a puncher’s probability — however there aren’t any ensures on this excessive stakes contest.

A brand new age of ubiquitous, actually clever computing is on the horizon. Come June, we’ll see if Apple has executed sufficient to make sure it has a hand in shaping it.

Source link

Contents
Enhancing conversational assistantsSensible functions and limitationsApple races to shut AI hole as rivals soar
TAGGED: Apple, context, develop, researchers, screen, Understand
Share This Article
Twitter Email Copy Link Print
Previous Article Nucor to Acquire Southwest Data Products, for $115M Nucor to Acquire Southwest Data Products, for $115M
Next Article construction site barricades Microsoft unveils safety and security tools for generative AI
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

OneChronos Raises $32M in Funding

OneChronos, a NYC-based know-how firm leveraging advances in public sale concept and laptop science to…

November 20, 2024

Nokia, A1, Microsoft to pioneer inaugural 5G edge cloud network slicing solution

Nokia and A1 Austria (A1) have introduced the completion of the first-ever 5G edge cloud…

February 16, 2024

Data Center Mechanical Construction Market size is set to grow by USD 18.12 billion from 2024-2028, Growing investments in data center construction boost the market, Technavio

NEW YORK, July 12, 2024 /PRNewswire/ -- The worldwide knowledge middle mechanical development market measurement…

July 13, 2024

Quantum computing is set to destroy crypto. Could cloud-based quantum-proof encryption be the solution?

Whereas nonetheless in its early phases, Quantum Computing is predicted to revolutionise problem-solving and knowledge…

June 3, 2024

SingleOps Merges With Landscape Management Network; Receives Investment

SingleOps, an Atlanta, GA-based supplier of arbor and panorama enterprise software program, introduced its merger…

November 5, 2024

You Might Also Like

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale
AI

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale

By saad
Visa prepares payment systems for AI agent-initiated transactions
AI

Visa prepares payment systems for AI agent-initiated transactions

By saad
For effective AI, insurance needs to get its data house in order
AI

For effective AI, insurance needs to get its data house in order

By saad
Mastercard keeps tabs on fraud with new foundation model
AI

Mastercard keeps tabs on fraud with new foundation model

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.