View of a report in column display on a desktop monitor

SAS® Data Loader for Hadoop

Manage big data on your own terms with self-service data integration and data quality

Get your data right where you want it by loading it into – or out of – Hadoop so it’s ready and available for reports, visualizations or advanced analytics. Sound easy? It is. Because you can do it all yourself. SAS Data Loader for Hadoop empowers you to manage your own data without writing code.


Manage data without specialized skills.

No need to outsource anything. You know what you need from your data, and you can do it yourself. SAS Data Loader for Hadoop makes it easy for business users or data scientists to perform big data integration, data quality and data preparation tasks without writing complex MapReduce code or asking for outside help.

Improve scalability and performance.

While business users appreciate the solution for its ease of use, data scientists and SAS coders like how it improves speed, efficiency and agility. A code accelerator makes it possible to run code in parallel on the Hadoop cluster for faster performance. Plus, you can improve data quality and implement data profiling without moving your data.

Free up IT for more technical tasks.

When the data scientists on your team are weighed down with basic data management duties, their advanced skills go underused – and business takes a hit. SAS Data Loader for Hadoop frees up IT to focus on making your systems better, faster and more powerful.

Derive more value from your big data.

It’s easy to load data from relational data sources or SAS data sets to and from Hadoop – and put your big data to work. There are endless opportunities for advanced analytics and other technologies that have the potential to transform your organization.

SAS data expert Matt Magne explains how SAS Data Loader for Hadoop tackles the Hadoop skills shortage and empowers you to prepare, integrate and cleanse big data.



View of a report in table display on a laptop monitor
  • Intuitive user interface. Easily access, transform and manage data stored in Hadoop with one web-based interface that reduces training requirements.
  • Purpose-built to load data to and from Hadoop. Built from the ground up to manage big data on Hadoop; not repurposed from existing IT-focused tools.
  • Big data quality and profiling. With directives that include casing, gender analysis, pattern analysis and field extraction – plus profiling that runs in-parallel on the Hadoop cluster for better performance – data will be accurate and ready for action.
  • Big data integration. Import data from CSV and other delimited files into Hadoop. Plus, you can run HiveQL commands and delete rows on Hadoop tables.

    • In-memory analytics server. Load data in memory to prepare it for high-performance reporting, visualization or analytics.
      • In-cluster code and data quality execution. Execute analytics and data quality processing within Hadoop for fast, budget-friendly results. Minimize data movement for increased scalability, governance and performance.
      • Improved security. SAS Data Loader for Hadoop supports Active Directory and Lightweight Directory Access Protocol for user authorization.

        Technical Information


        SAS Data Loader



        Try It for Free

        Get a free production trial of SAS Data Loader for Hadoop. The trial includes step-by-step installation and configuration instructions, plus access to video tutorials and other resources, to help you get the most out of your evaluation.

        Try It for Free

        SAS Data Loader

        Get a desktop version of SAS Data Loader for Hadoop free for 90 days. You'll also get step-by-step installation and configuration instructions, plus access to video tutorials and other resources, to help you get the most out of your evaluation.

        Operating System

        • Windows x64 running Windows 7 or later
        • Mac OS X running 10.8 or later


        • Minimum 16GB
          Note: SAS Data Loader for Hadoop requires at least 4GB of available memory. An additional 4GB is recommended for the Hadoop virtual machine.

        Disk Space

        • At least 30GB of free hard drive space


        • Must be enabled for virtualization technology


        • Minimum – 2 cores (4 logical processors)
        • Recommended – 4 cores (8 logical processors) or more

        Web Browsers

        • Windows x64 –
          • Windows Internet Explorer 9 or later
          • Mozilla Firefox 14 or later
          • Google Chrome 21 or later

        • Mac OS X –
          • Apple Safari 7 or later

        Virtualization Environment

        • Windows x64 –
          • VMware Player 6 or later; or
          • VMware Workstation 10 or later
        • Mac OS X –
          • VMware Fusion 6 or later

        Hadoop Virtual Machine

        • Cloudera QuickStart VM CDH 5.3; or
        • Hortonworks HDP 2.2 on Sandbox


        Looking for information on how to buy?

        Need additional information? Get details on solutions,
        licensing, deployment and more.


        Ready to get started? Take the next step toward getting
        more value from your data.

        Back to Top