Load your data into or out of Hadoop and data lakes. Prep it so it's ready for reports, visualizations or advanced analytics – all inside the data lakes. And do it all yourself, quickly and easily.
Intuitive user interface
Makes it easy to access, transform and manage data stored in Hadoop or data lakes with a web-based interface that reduces training requirements.
Purpose-built to load data to/from Hadoop
Built from the ground up to manage big data on Hadoop or in data lakes; not repurposed from existing IT-focused tools.
Chaining and scheduling directives
Lets you group multiple directives to run simultaneously or one after the other. Schedule and automate directives using the exposed Public API.
Collaboration and security
Enables you to share and secure directives. Call them from SAS Data Integration Studio, uniting technical and nontechnical user activities.
Big data quality and profiling
Includes built-in directives – casing, gender and pattern analysis, field extraction, match-merge and cluster-survive. Profiling runs in-parallel on the Hadoop cluster for better performance.
In-memory analytics server
Lets you load data in memory to prepare it for high-performance reporting, visualization or analytics.
In-cluster code and data quality execution
Executes analytics and data quality processing within Hadoop for fast, budget-friendly results. Minimizes data movement for increased scalability, governance and performance.
Big data integration
Imports data from CSV, SAS/ACCESS® libraries, cloud sources, relational databases and other delimited files into Hadoop. Runs HiveQL and Impala SQL.
Take control of your data – and free up IT – with self-service data preparation.
Manage data – no special skills needed.
Perform data integration, data quality and data preparation tasks yourself, without having to write code or ask for specialized help. SAS Data Loader for Hadoop bridges the skills gap, giving all users access to their data regardless of technical ability.
Boost scalability and performance.
Business users find it easy to use. Data scientists and SAS coders like its speed, efficiency and agility. A code accelerator harnesses the power of Hadoop, and data quality functions run in memory on Spark for better performance. And by minimizing data movement, you increase your data's security.
Ensure big data quality.
Take control of the data within data lake environments. SAS Data Loader for Hadoop allows you to profile data to understand its overall quality. Then you can standardize, parse, match and perform other core data quality functions, resulting in quality data for business needs inside of the data lakes.
Speed data management processes with Spark.
Data quality functions run in memory in Apache Spark for improved performance. Matching and best record creation enables master data management for big data. In addition, you can read and write to Spark data sets as needed.
Explore More on SAS® Data Loader for Hadoop and Beyond
To browse resources by type, select an option below.
-
- Select Resource Type
- Analyst Report
- Article
- Blog Post
- Book Excerpt
- Case Study
- Customer Story
- E-Book
- Infographic
- Interview
- Research
- Series
- Video
- Webinar
- White Paper
- White Paper
-
White Paper
Improving Data Preparation for Business Analytics
This TDWI Best Practices Report discusses the latest data preparation processes, self-service options and how to effectively integrate data prep with analytics and BI solutions.r More Productive Users
-
White Paper
Data Integration Déjà Vu: Big Data Reinvigorates DI
Discover why the latest evolution of data integration delivers more value from big data.
-
White Paper
Managing the Analytics Life Cycle for Decisions at Scale
Let the SAS Analytics Life Cycle guide you through the iterative process of going from raw data to predictive modeling to automated decisions, faster. This paper tells you how.
-
White Paper
Data Quality Challenges and Priorities
Find out how organizations are addressing their most pressing data quality issues, discover the top 10 priorities for data quality solutions, and learn the best ways to engage and empower business users to improve data quality.
-
White Paper
A Non-Geek’s Big Data Playbook
This paper examines how a non-geek yet technically savvy business professional can understand how to use Hadoop - and how it will impact enterprise data environments for years to come. The paper serves as a playbook that demonstrates six common “plays” that illustrate how Apache Hadoop can support and extend the enterprise data warehouse (EDW) ecosystem.
-
White Paper
Eight Considerations for Utilizing Big Data Analytics with Hadoop
This paper focuses on eight considerations when it comes to applying big data analytics to extract value from Hadoop. Readers will learn about the importance of in-memory analytics, how to optimize the data preparation processes and the skill sets needed to derive benefits from Hadoop.
-
E-Book
An Early Adopter's Guide to Hadoop
Read interviews with two Hadoop customers and two Hadoop implementors, plus Hadoop adoption survey results and tips for big data management - all in one convenient ebook package.
-
White Paper
Best Practices for Hadoop: A Guide From SAS Customers
Ease your transition to Hadoop by following these tried-and-true best practices from SAS customers.
-
White Paper
Bringing the Power of SAS to Hadoop
Want to get even more value from your Hadoop implementation? Learn about the SAS portfolio of solutions that enable you to bring the full power of business analytics to Hadoop.
-
White Paper
Making Sense of Hadoop and its Ecosystem
An Investigation into the Evolution and Deployment of Hadoop
-
White Paper
Self-Service Big Data Preparation in the Age of Hadoop: A Conversation With SAS
Learn how SAS Data Loader for Hadoop enables business users – along with data scientists and IT – to access, profile, transform and cleanse Hadoop data. The best part is, it requires minimal training and no need for coding.
-
White Paper
The Current State of Hadoop in the Enterprise
Written by the International Institute for Analytics, this paper presents a broad view of Hadoop in the marketplace, including how it is being adopted and used by global organizations. It then offers a set of recommendations that organizations can use to succeed with Hadoop.
-
White Paper
SAS and the Hadoop Ecosystem
A vendor profile from The Bloor Group that refers to the full report: Making Sense of Hadoop and its Ecosystem.
- Interview The scoop on HadoopCloudera co-founder Mike Olson talks trends, changes and outlines the formula for success with Hadoop.
-
White Paper
How Hadoop Helps Realize Big Data Value
Leveraging a big-picture approach to Hadoop opens up a world of
data-fueled opportunities. -
White Paper
Understanding Big Data Quality for Maximum Information Usability
Discover why data quality and data governance are so important to large-scale analytics. Learn how to balance governance with usability so you can come up with a strategic plan for managing big data.
-
White Paper
Hadoop for the Enterprise: Making Data Management Massively Scalable, Agile, Feature-Rich, and Cost-Effective
This TDWI report accelerates users’ understanding of new products, technologies and best practices that have emerged around Hadoop. It will also help readers connect available options to use cases, with a focus on mainstream enterprise uses, while respecting proven IT practices and delivering maximum business value.
-
White Paper
SAS® and Apache Hadoop for Government
This paper focuses on the integration of Hadoop-centric big data approaches with user-focused business analytics capabilities. Users see accessible business analytics tools that produce results, without the need for IT staff to craft queries in programming languages.
- Analyst Report Gartner positions SAS as a leader in the Magic Quadrant for Data Integration ToolsSAS is a leader in Gartner's Magic Quadrant for Data Integration Tools. Data integration from SAS helps you break down data silos and boost productivity.
- Analyst Report SAS throws its hat into the self-service data preparation ring with Data Loader for HadoopThe strategy centers on three key themes: 'big data'management and governance involving Hadoop, the streamlining of access to information, and the use of its federation and integration offerings to enable the right data to be available, at the right time.
Related Products
Check out these products related to SAS Data Loader for Hadoop, built on the powerful SAS® Platform.