Monday, 15 Dec 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Structify raises $4.1M seed to turn unstructured web data into enterprise-ready datasets
AI

Structify raises $4.1M seed to turn unstructured web data into enterprise-ready datasets

Last updated: May 4, 2025 6:53 am
Published May 4, 2025
Share
Structify raises $4.1M seed to turn unstructured web data into enterprise-ready datasets
SHARE

Be part of our every day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra


A Brooklyn-based startup is taking intention at one of the crucial infamous ache factors on this planet of synthetic intelligence and information analytics: the painstaking course of of information preparation.

Structify emerged from stealth mode immediately, asserting its public launch alongside $4.1 million in seed funding led by Bain Capital Ventures, with participation from 8VC, Integral Ventures and strategic angel traders.

The corporate’s platform makes use of a proprietary visible language mannequin known as DoRa to automate the gathering, cleansing, and structuring of information — a course of that sometimes consumes as much as 80% of information scientists’ time, in response to {industry} surveys.

“The quantity of knowledge obtainable immediately has completely exploded,” stated Ronak Gandhi, co-founder of Structify, in an unique interview with VentureBeat. “We’ve hit a significant inflection level in information availability, which is each a blessing and a curse. Whereas now we have unprecedented entry to info, it stays largely inaccessible as a result of it’s so troublesome to transform into the precise format for making significant enterprise selections.”

Structify’s strategy displays a rising industry-wide deal with fixing what information specialists name “the information preparation bottleneck.” Gartner analysis signifies that inadequate data preparation stays one of many main obstacles to profitable AI implementation, with 4 of 5 companies missing the information foundations essential to completely capitalize on generative AI.

How AI-powered information transformation is unlocking hidden enterprise intelligence at scale

At its core, Structify permits customers to create customized datasets by specifying the information schema, choosing sources, and deploying AI brokers to extract that information. The platform can deal with every thing from SEC filings and LinkedIn profiles to information articles and specialised {industry} paperwork.

What units Structify aside, in response to Gandhi, is their in-house mannequin DoRa, which navigates the net like a human would.

“It’s tremendous high-quality. It navigates and interacts with stuff similar to an individual would,” Gandhi defined. “So we’re speaking about human high quality — that’s the before everything heart of the rules behind DoRa. It reads the web the way in which a human would.”

See also  Delska's new 10MW data centre in Latvia achieves Tier III design certification

This strategy permits Structify to assist a free tier, which Gandhi believes will assist democratize entry to structured information.

“The way in which during which you consider information now’s, it’s this actually valuable object,” Gandhi stated. “This actually valuable factor that you just spend a lot time finagling and getting and wrestling round, and when you’ve gotten it, you’re like, ‘Oh, if somebody was to delete it, I’d cry.’”

Structify’s imaginative and prescient is to “commoditize information” — making it one thing that may be simply recreated if misplaced.

From finance to building: How companies are deploying customized datasets to unravel industry-specific challenges

The corporate has already seen adoption throughout a number of sectors. Finance groups use it to extract info from pitch decks, building corporations flip advanced geotechnical paperwork into readable tables, and gross sales groups collect real-time organizational charts for his or her accounts.

Slater Stich, accomplice at Bain Capital Ventures, highlighted this versatility within the funding announcement: “Each firm I’ve ever labored with has a handful of information sources which might be each extraordinarily essential and an enormous ache to work with, whether or not that’s figures buried in PDFs, scattered throughout a whole bunch of internet pages, hidden behind an enterprise SOAP API, and many others.”

The variety of Structify’s early buyer base displays the common nature of information preparation challenges. In accordance with TechTarget research, information preparation sometimes includes a collection of labor-intensive steps: assortment, discovery, profiling, cleaning, structuring, transformation, and validation — all earlier than any precise evaluation can start.

Why human experience stays essential for AI accuracy: Inside Structify’s ‘quadruple verification’ system

A key differentiator for Structify is its “quadruple verification” course of, which mixes AI with human oversight. This strategy addresses a important concern in AI improvement: guaranteeing accuracy.

“At any time when a person sees one thing that’s suspicious, or we determine some information as probably suspicious, we will ship it to an professional in that particular use case,” Gandhi defined. “That professional can act in the identical means as [DoRa], navigate to the precise piece of knowledge, extract it, put it aside, after which confirm if it’s proper.”

See also  qwq-32b-launches-high-efficiency-performance-reinforcement | VentureBeat

This course of not solely corrects the information but additionally creates coaching examples that enhance the mannequin’s efficiency over time, particularly in specialised domains like building or pharmaceutical analysis.

“These issues are so messy,” Gandhi famous. “I by no means thought in my life I’d have a powerful understanding of geology. However there we’re, and that, I feel, is a large power – with the ability to study from these specialists and put it straight into DoRa.”

As information extraction instruments grow to be extra highly effective, privateness considerations inevitably come up. Structify has carried out safeguards to deal with these points.

“We don’t do any authentication, something that required a login, something that requires you to go behind some sense of knowledge – our agent doesn’t do this as a result of that’s a privateness concern,” Gandhi stated.

The corporate additionally prioritizes transparency by offering direct sourcing info. “In the event you’re excited about studying extra a few explicit piece of knowledge, you go on to that content material and see it, versus form of legacy suppliers the place it’s this black field.”

Structify enters a aggressive panorama that features each established gamers and different startups addressing numerous elements of the information preparation problem. Corporations like Alteryx, Informatica, Microsoft, and Tableau all supply information preparation capabilities, whereas a number of specialists have been acquired lately.

What differentiates Structify, in response to CEO Alex Reichenbach, is its mixture of pace and accuracy. A latest LinkedIn publish by Reichenbach claimed they’d sped up their agent “10x whereas reducing value ~16x” via mannequin optimization and infrastructure enhancements.

The corporate’s launch comes amid rising curiosity in AI-powered information automation. In accordance with a TechTarget report, automating information preparation “is steadily cited as one of many main funding areas for information and analytics groups,” with augmented information preparation capabilities turning into more and more essential.

How irritating information preparation experiences impressed two associates to revolutionize the {industry}

For Gandhi, Structify addresses issues he confronted firsthand in earlier roles.

See also  Google Invests in Venture to Build Energy Parks for Data Centers

“The massive factor in regards to the founding story of Structify is it’s each form of a private and an expert factor,” Gandhi recalled. “I used to be telling [Alex] in regards to the time that I used to be working as a knowledge analyst and doing ops and consulting, getting ready these actually area of interest, bespoke information units for shoppers — lists of all of the health influencers and their following metrics, lists of corporations and what jobs they’re posting, museums on the East Coast… I used to be spending a whole lot of time doing manually curating them, scraping, information entry, all these things.”

The shortcoming to rapidly iterate from concept to dataset was significantly irritating. “What acquired me was that you just couldn’t iterate and form of go from concept to information set in a fast style,” Gandhi stated.

His co-founder, Alex Reichenbach, encountered comparable challenges whereas working at an funding financial institution, the place information high quality points hampered efforts to construct fashions on prime of structured datasets.

How Structify plans to make use of its $4.1 million seed funding to rework enterprise information preparation

With the brand new funding, Structify plans to develop its technical crew and set up itself as “the go-to information software throughout industries.” The corporate at present presents each free and paid tiers, with enterprise choices for these needing superior options like on-premise deployment or extremely specialised information extraction.

As extra corporations spend money on AI initiatives, the significance of high-quality, structured information will solely improve. A latest MIT Technology Review Insights report discovered that 4 out of 5 companies aren’t able to capitalize on generative AI due to poor information foundations.

For Gandhi and the Structify crew, fixing this basic problem might unlock vital worth throughout industries.

“The truth that you possibly can even think about a world which creating information units is iterative is form of thoughts boggling for lots of our customers,” Gandhi stated. “On the finish of the day, the pitch is about with the ability to have this management and customizability.”


Source link
TAGGED: 4.1M, data, datasets, Enterpriseready, Raises, Seed, Structify, turn, unstructured, web
Share This Article
Twitter Email Copy Link Print
Previous Article Ara Partners Ara Partners Reaches Final Close for Inaugural Infrastructure Fund
Next Article Connectbase Launches Linkbase for Automated Partner Approvals Connectbase Launches Linkbase for Automated Partner Approvals
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Landbase Raises $30M in Series A Funding

Landbase, a San Francisco, CA-based agentic AI firm, raised $30M in Collection A funding. The…

June 15, 2025

Unlock the other 99% of your data – now ready for AI

For many years, firms of all sizes have acknowledged that the information accessible to them…

June 20, 2025

Genspark adds Claude-powered financial reports on demand

Be a part of our day by day and weekly newsletters for the most recent…

November 15, 2024

Xen 4.20 Released: Improved Security, Performance, Architecture Support

With the discharge of Xen 4.20, the Linux Basis-hosted open-source hypervisor Xen Challenge has improved…

March 7, 2025

Health data storage has a climate cost. In the future, data may be stored in DNA

Credit score: College of Copenhagen Loads of vitality is required to research and retailer massive…

April 8, 2024

You Might Also Like

Tokenization takes the lead in the fight for data security
AI

Tokenization takes the lead in the fight for data security

By saad
Siemens and nVent: pioneering liquid cooling solutions for AI data centres
Power & Cooling

Siemens and nVent: pioneering liquid cooling solutions for AI data centres

By saad
nVent electric unveils advanced liquid cooling solutions for data centres
Infrastructure

nVent electric unveils advanced liquid cooling solutions for data centres

By saad
Drax outlines 100MW data centre plan at Selby power station
Global Market

Drax outlines 100MW data centre plan at Selby power station

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.