Paxata Announces Winter Release – Builds On Smart End User Capabilities, Comprehensive, Open Platform And Enterprise-grade Governance


New Release Underscores Commitment to Industry Standards and Open Source Innovation
with Additional Machine Learning Capabilities that Enables Continued Ease-of-Use

Redwood City, CA – January 27, 2016 Paxata, provider of the only Adaptive Data Preparation™ platform for the enterprise, today announced the availability of its Winter ’15 product release. Delivered on a flexible platform, the latest version provides business analysts with comprehensive data preparation capabilities and unparalleled governance and administrative controls. Paxata’s platform serves as the connected information layer within some of the world’s largest, most complex infrastructures, whether used on-premises, in private clouds, or in a company’s proprietary cloud-based data preparation as a service (DPaaS) offering.

Supported by enterprise-grade security and a multi-tenant governance model, the Winter release gives administrators the ability to deploy Paxata in heterogeneous environments including the Hortonworks Data Platform on YARN and with multiple flavors of Apache Spark. The latest version also significantly improves how business analysts can find, access and apply data by delivering additional one-click capabilities powered by machine learning innovations.

Paxata also recently outperformed every vendor in the self-service data preparation category in the Butler Analytics Reviews. According to Martin Butler, founder of Butler Analytics: “Based on our overall rating, Paxata is ahead of the pack when it comes to building a platform that anticipates the business analysts need for agility while maintaining the controls necessary to satisfy even the toughest IT standards.”

“Paxata continues to be a strategic partner to customers who want a modern solution capable of transforming raw data into ready information,” said Prakash Nanduri, Co-Founder and CEO of Paxata. “Our innovative approach meets the needs of the most demanding customers including the top three banks in the US, the biggest semiconductor company in the world, the most trusted audit firm, the leading networking company and the largest Government agencies. This release adds essential capabilities into the connected information layer, delivering freedom and flexibility to non-technical business teams while standing up to the toughest IT standards.”

Designed to improve the business analyst experience, the Winter release supports new, easy to use visual data transformations. These include auto number for automatically creating unique identifiers and fill down that intelligently detects missing values in a column caused by aggregation and fills them in. The solution includes sophisticated machine learning and other advanced algorithms such as improved accuracy and performance for recommendations. It also includes expanded coverage for multiple dataset use cases, and enhancements to facilitate new business scenarios with complex textual data such as product catalog descriptions.

With enterprise-grade multi-tenant governance capabilities, Paxata eliminates the security and control concerns IT and business users face with traditional desktop applications. Using Paxata, business teams operate with greater security without having to change how they access the system. Administrators can leverage their organizations’ existing investments in authentication and authorization and still retain complete flexibility with per-tenant governance schemes and the ability to delegate application-specific group creation to trusted end-users. More specifically the Winter release:

  • Integrates with LDAP: which allows administrators to manage application access and user roles with their existing trusted central directory services via Lightweight Directory Access Protocol (LDAP)-compliant integration for both authentication and authorization.
  • Integrates with SAML for SSO: lets administrators leverage Paxata’s industry standard SAML 2.x integration which allows users to login to Paxata without entering any credentials via Single-Sign On capabilities common among other business-critical applications.
  • Provides dynamic provisioning: which gives administrators the freedom to provision new users and groups in Paxata selected from their central and trusted LDAP directory. Paxata syncs directly to existing authentication and authorization providers for the most up-to-date status for users joining, switching groups, and leaving the organization. It avoids the common sync challenges found in static systems and helps ensure unified user management.

Paxata’s open, flexible architecture offers critical capabilities for enterprises with mixed Hadoop environments across the business for different data storage and processing scenarios. Paxata’s latest release can be run on Apache Spark deployed on YARN for optimized cluster usage. This allows administrators to maximize their existing hardware investments by co-locating Paxata with other Spark-based technologies, and flexibly adjust resources allocated to Paxata.

The Winter release also provides full support for the Hortonworks Data Platform, which enables complete capabilities for data ingest, outputs and processing. Now, Hortonworks customers can leverage the industry’s leading interactive, visual, business analyst-centric data preparation experience. Finally, the solution provides multiple, simultaneous Hadoop cluster support for data import and data publish. Unlike other Hadoop ecosystem applications, which can only connect to only one version of one distribution at a time, Paxata can connect to multiple versions of Cloudera and Hortonworks clusters simultaneously. This allows data input from any supported HDFS and allows AnswerSets to be published out to any other supported HDFS destination, and supports data migration scenarios across various Hadoop and non-Hadoop systems.

About Paxata
Paxata is the only Adaptive Data Preparation™ platform for the enterprise. Paxata’s platform provides an interactive, analyst-centric data prep experience powered by a unified set of technologies designed from the ground up for comprehensive data integration, data quality, semantic enrichment, collaboration and governance. Information-driven organizations who want to make data worth analyzing use Paxata to explore, clean, shape, and combine all the data they need into rich AnswerSets™ which power ad hoc, operational, predictive and packaged analytics.

Paxata’s platform, built on Apache Spark and optimized to run in Hadoop environments, leverages distributed computing, machine learning and a dynamically visual workspace that promotes transparent governance and ad hoc collaboration. Paxata data prep, powered by IntelliFusion™, is designed to eliminate the need for coding, scripting and sampling. The solution is available as a service, and can be deployed in AWS virtual private clouds or within Hadoop environments at customer sites.

Paxata is headquartered in Redwood City with offices in New York, Ohio and Washington DC. Visit, follow @Paxata, connect on, follow us at and watch us on

Try Now
Show Buttons
Hide Buttons