Press release

Data Preparation Provider Paxata Announces Issuance of U.S. Patent for Automated Join Detection

0
Sponsored by Businesswire

Paxata,
the pioneer in self-service data preparation, today announced that the
United States Patent and Trademark Office has granted the Company patent US20170109402A1
for Automated Join Detection across two diverse data sets. This feature
is a key cornerstone in empowering non-technical business consumers to
visually and interactively prepare, enrich and turn raw data into ready
information at scale. Automatic Join Detection is foundational to the Paxata
Adaptive Information Platform
in that it uses Natural Language
Processing (NLP), search techniques and other algorithms to identify
candidate columns that could form the basis for joining two disparate
data sets.

“The entire objective of self-service data preparation is for
intelligent algorithms to do the heavy lifting. What can take weeks and
months to do is now possible in a matter of seconds. That is the power
of automated join detection,” said Prakash Nanduri, Co-Founder and CEO.
“Finding, preparing and governing data that resides across the
enterprise and multiple clouds is the key challenge of the day. Our
vision has always been about enabling business users and intelligent
systems to consume clean data easily and rapidly. This patent
underscores Paxata’s position as the most powerful enterprise
information platform in the industry. I congratulate the team, led by my
co-founders Dave Brewster and Nenshad Bardoliwalla, who brought this to
life.”

Unlike traditional approaches that rely on similarly named and formatted
columns for join detection, Paxata will look for possible similarities
in one or multiple columns from each of the data sets and propose a list
of joins to the user with a confidence factor based on the completeness
of the join operation. For example, one data set might have separate
columns for First and Last names, while the second data set might have a
column for Last name, First Name.

Beyond the intelligent algorithms used, Paxata’s patent also covers the
ability to perform the processing for the Automatic Join Detection
across distributed compute nodes, enabling it to process massive volumes
of data, and not just on small sample data sets. Paxata also delivers
this in a configurable manner via its intuitive user interface which
allows the user to select various algorithmic options quickly and easily.

About Paxata

At Paxata, we transform data into information on-demand to empower every
person, process, and system in the organization to be more intelligent.
Our Adaptive Information Platform provides business leaders and analysts
with an enterprise-grade, self-service data preparation application to
deliver better customer experiences, improve operational efficiencies,
and comply with regulatory requirements. Built on Apache SparkTM and
optimized to run in hybrid, multi-cloud environments, Paxata leverages
algorithmic intelligence and distributed computing to deliver an
immersive business consumer experience that accelerates and automates
the data-to-insight pipeline. Paxata is headquartered in Redwood City,
California with offices in New York, Ohio, Texas, and Singapore. Visit www.paxata.com or
engage with us on Twitter,
LinkedInFacebook,
or YouTube.