Saturday, 21 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Innovations > Using AI to turn sound recordings into accurate street images
Innovations

Using AI to turn sound recordings into accurate street images

Last updated: November 27, 2024 6:44 pm
Published November 27, 2024
Share
Using AI to turn sound recordings into accurate street images
SHARE
Credit score: College of Texas at Austin

Utilizing generative synthetic intelligence, a workforce of researchers at The College of Texas at Austin has transformed sounds from audio recordings into street-view photos. The visible accuracy of those generated photos demonstrates that machines can replicate human connection between audio and visible notion of environments.

In a paper published in Computer systems, Surroundings and City Methods, the analysis workforce describes coaching a soundscape-to-image AI mannequin utilizing audio and visible information gathered from quite a lot of city and rural streetscapes after which utilizing that mannequin to generate photos from audio recordings.

“Our examine discovered that acoustic environments comprise sufficient visible cues to generate extremely recognizable streetscape photos that precisely depict totally different locations,” stated Yuhao Kang, assistant professor of geography and the surroundings at UT and co-author of the examine. “This implies we will convert the acoustic environments into vivid visible representations, successfully translating sounds into sights.”

Utilizing YouTube video and audio from cities in North America, Asia and Europe, the workforce created pairs of 10-second audio clips and picture stills from the assorted places and used them to coach an AI mannequin that might produce high-resolution photos from audio enter. They then in contrast AI sound-to-image creations made out of 100 audio clips to their respective real-world images, utilizing each human and laptop evaluations.

Pc evaluations in contrast the relative proportions of greenery, constructing and sky between supply and generated photos, whereas human judges have been requested to appropriately match certainly one of three generated photos to an audio pattern.

Researchers use AI to turn sound recordings into accurate street images
Credit score: College of Texas at Austin

The outcomes confirmed robust correlations within the proportions of sky and greenery between generated and real-world photos and a barely lesser correlation in constructing proportions. And human contributors averaged 80% accuracy in choosing the generated photos that corresponded to supply audio samples.

See also  EU and Japan advance partnership for digital transformation

“Historically, the power to examine a scene from sounds is a uniquely human functionality, reflecting our deep sensory reference to the surroundings. Our use of superior AI strategies supported by massive language fashions (LLMs) demonstrates that machines have the potential to approximate this human sensory expertise,” Kang stated.

“This means that AI can lengthen past mere recognition of bodily environment to probably enrich our understanding of human subjective experiences at totally different locations.”

Along with approximating the proportions of sky, greenery and buildings, the generated photos usually maintained the architectural types and distances between objects of their real-world picture counterparts, in addition to precisely reflecting whether or not soundscapes have been recorded throughout sunny, cloudy or nighttime lighting circumstances.

The authors word that lighting info may come from variations in exercise within the soundscapes. For instance, site visitors sounds or the chirping of nocturnal bugs may reveal time of day. Such observations additional the understanding of how multisensory elements contribute to our expertise of a spot.

“While you shut your eyes and pay attention, the sounds round you paint footage in your thoughts,” Kang stated. “As an illustration, the distant hum of site visitors turns into a bustling cityscape, whereas the light rustle of leaves ushers you right into a serene forest. Every sound weaves a vivid tapestry of scenes, as if by magic, within the theater of your creativeness.”

Kang’s work focuses on utilizing geospatial AI to check the interplay of people with their environments. In one other recent paper revealed in Humanities and Social Sciences Communications, he and his co-authors examined the potential of AI to seize the traits that give cities their distinctive identities.

See also  Can AI turn data centres into green champions?

Extra info:
Yonggai Zhuang et al, From listening to to seeing: Linking auditory and visible place perceptions with soundscape-to-image generative synthetic intelligence, Computer systems, Surroundings and City Methods (2024). DOI: 10.1016/j.compenvurbsys.2024.102122

Kee Moon Jang et al, Place identification: a generative AI’s perspective, Humanities and Social Sciences Communications (2024). DOI: 10.1057/s41599-024-03645-7

Supplied by
College of Texas at Austin


Quotation:
Utilizing AI to show sound recordings into correct road photos (2024, November 27)
retrieved 27 November 2024
from https://techxplore.com/information/2024-11-ai-accurate-street-images.html

This doc is topic to copyright. Aside from any honest dealing for the aim of personal examine or analysis, no
half could also be reproduced with out the written permission. The content material is offered for info functions solely.



Source link

TAGGED: accurate, images, recordings, sound, Street, turn
Share This Article
Twitter Email Copy Link Print
Previous Article business intelligence How the Steel Industry’s Shift Toward Sustainability Affects Market Valuations
Next Article How to handle the top challenges of SaaS management How to handle the top challenges of SaaS management
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Electricity Demand at Data Centers Could Double in Three Years | DCN

(Bloomberg) -- Global electricity demand from data centers, cryptocurrencies, and artificial intelligence could more than…

January 24, 2024

Serverless edge computing redefines data processing at the network’s edge

Within the ever-evolving world of expertise, buzzwords come and go, usually promising greater than they…

July 19, 2024

Your Guide to Entering the World of Forex

All in favour of foreign currency trading however undecided the place to start out? With…

November 29, 2024

Vercel acquires ModelFusion, launches AI SDK 3.1 for enterprise AI development

Uncover how firms are responsibly integrating AI in manufacturing. This invite-only occasion in SF will…

May 5, 2024

The beginning of the end of the transformer era? Neuro-symbolic AI startup AUI announces new funding at $750M valuation

The buzzed-about however nonetheless stealthy New York Metropolis startup Augmented Intelligence Inc (AUI), which seeks…

November 4, 2025

You Might Also Like

AI could accurately deliver flood warnings in data-scarce regions
Innovations

AI could accurately deliver flood warnings in data-scarce regions

By saad
ARCHER2 supercomputer
Innovations

ARCHER2 supercomputer generates £4.2bn for UK economy

By saad
AI supercomputer
Innovations

UK unveils £45m Sunrise AI supercomputer to accelerate fusion

By saad
The password is dying – biometrics are taking their place
Innovations

The password is dying – biometrics are taking their place

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.