Products & Solutions / In-Memory Analytics

SAS® High-Performance Analytics

Generate highly accurate and timely insights and solve complex problems using larger volumes of data than ever before

While data volumes and business complexities continue to grow, decision makers need answers faster than ever – in minutes or seconds, instead of hours or days. Unfortunately, many IT infrastructures are not equipped to capture and analyze big data. Analytic professionals may not be able to incorporate the newest data, or they may be restricted to using subsets of data or suboptimal modeling techniques.

SAS High-Performance Analytics is appliance-ready software that uses hardware from our database partners (Teradata or EMC Greenplum) to solve complex problems using in-memory processing resources. It enables organizations to derive highly accurate and timely insights in minutes, not hours, to make better-informed business decisions. 

Benefits

  • Act quickly and confidently to seize new opportunities, manage risks and make the right choices.
  • Don't limit yourself to using simplified analytical approaches for solving complex problems.
  • Derive insights at breakthrough speeds for high-value and time-sensitive decision making.
  • Take full advantage of a highly scalable and reliable infrastructure that is optimized for big data and complex analytics.

Read more

Features

  • High-performance analytics
  • High-performance data exploration
  • High-performance variable reduction
  • High-performance linear regression
  • High-performance logistic regression
  • High-performance nonlinear regression
  • High-performance mixed linear models
  • High-performance neural networks
  • High-performance data mining
  • High-performance data mining DATABASE procedure
  • High-performance random forest decision trees (experimental)
  • High-performance count regression
  • High-performance severity models

Read more

" SAS High-Performance Analytics can turn any data, including big data, assets into quicker, better business decisions and ultimately competitive advantage."

— Dan Vesset

Program Vice President, IDC's Business Analytics research

Read the press release


Screenshot

High-performance neural networks enable modelers to produce more training runs for significantly more lift and incremental predictive power.


Screenshots

How SAS® Is Different

  • Integrated technology from the leaders in business analytics and data warehouse appliances. SAS expertise in analytics and our partners' (EMC Greenplum and Teradata) deep understanding of massively parallel processing scenarios will offer customers highly optimized, in-memory analytics to confidently tackle difficult problems at a much faster pace.
  • SAS High-Performance Analytics qualifies as the only in-memory offering on the market that processes high-end analytics and big data to produce time-sensitive insights. SAS High-Performance Analytics is truly about applying high-end analytical techniques to solve complex business problems – not just about using query, reporting and descriptive statistics within an in-memory environment.
  • SAS High-Performance Analytics addresses the entire model development and deployment life cycle. Unlike other offerings, SAS High-Performance Analytics can perform analyses that range from descriptive statistics and data summarizations to model building and scoring new data at breakthrough speeds. These results enable our customers to extract more value from their data and stay ahead of the competition.
  • SAS provides a market-leading approach with this offering. SAS has reinvented its architecture and software to satisfy the demands of big data, larger problems and more complex scenarios, and to take advantage of new hardware advancements.

Benefits

  • Act quickly and confidently to seize new opportunities, manage risks and make the right choices. Better, faster and more accurate analytical results allow organizations to achieve significant added business value, drive new revenue opportunities and increase bottom-line savings.
  • Don't limit yourself to using simplified analytical approaches for solving complex problems. Use sophisticated analytics against all of your data (not just subsets or aggregates) to improve accuracy and enable more targeted, focused, high-impact decisions. Users can employ the best modeling techniques, perform model iterations more frequently and test new ideas to discover more accurate insights.
  • Derive insights at breakthrough speeds for high-value and time-sensitive decision making. Shrink the time it takes to go from model inception to deployment. SAS High-Performance Analytics delivers blazing fast performance so you can evaluate numerous scenarios and quickly detect, and act on, changing market conditions.
  • Take full advantage of a highly scalable and reliable infrastructure that is optimized for big data and complex analytics. With SAS High-Performance Analytics, analytical professionals can fully utilize the  in-memory architecture to quickly get answers to the most difficult business questions without the constraints imposed by the IT infrastructure.

Features

High-performance analytics
  • Enables high-performance capabilities of select SAS Analytics products (i.e., Base SAS®, SAS/STAT®, SAS/ETS® and SAS® Enterprise Miner™).
  • Uses a platform that supports access to large data stored in nearly any format currently on the market.
  • Reads input data in parallel and writes output data in parallel.
  • Executes SAS Analytics procedures across a distributed compute environment in parallel.
High-performance data exploration
  • Enables large-scale data exploration and summarization through a series of parallelized procedures.
  • Generates descriptive statistics on a large scale, very quickly.
  • Creates mean, min, max, range and measures of spread and centrality along with data for cardinality, summary and levels of variables.
High-performance variable reduction
  • Reduces dimensionality by using the HPREDUCE procedure to select a subset of the original variables (variable selection) to preserve model interpretation.
  • Performs unsupervised variable section by identifying a set of variables that jointly explain the maximum amount of data variance (covariance analysis).
  • Provides distributed computation and output of the CORR, COV or SSCP matrix.
  • Uses the CLASS statement to support categorical inputs.
  • Supports main and interaction effects with the VAR statement.
  • Outputs statistics and matrix information for exploratory data analysis that can also be used as direct input for statistical procedures. This saves time by eliminating redundant matrix aggregations.
High-performance linear regression
  • Supports partitioning of data into training, validation and testing roles.
  • Supports a FREQ statement for grouped analysis and a WEIGHT statement for weighted analysis.
  • Provides multiple effect-selection methods.
High-performance logistic regression
  • Predicts binary, binomial and multinomial outcomes.
  • Provides model-building syntax with the CLASS and effect-based MODEL statements.
  • Provides cumulative link models for ordinal data and generalized logistic modeling for unordered multinomial data enables model building (variable selection) through the SELECTION statement.
  • Provides a WEIGHT statement for weighted analysis and a FREQ statement for grouped analysis.
  • Provides an OUTPUT statement to produce a data set with predicted probabilities and other observation-wise statistics.
High-performance nonlinear regression
  • Computes analytical derivatives of user-provided expressions for more robust parameter estimations, improving the estimations as well as making them faster.
  • Evaluates user-provided expressions and their confidence limits with the ESTIMATE and PREDICT statements.
  • Estimates parameters by using least squares and the maximum likelihood method.
High-performance mixed linear models
  • Fits a variety of mixed linear models to data and enables you to use these fitted models to make statistical inferences about the data.
  • Supports multiple covariance structures.
  • Provides appropriate standard errors for all specified, estimable linear combinations of fixed and random effects, and corresponding t-tests and F-tests.
  • REML and ML estimation methods are implemented with a variety of optimization algorithms.
  • Provides special dense and sparse algorithms that take advantage of distributed and multiple-core computing environments.
High-performance neural networks
  • Provides automatic standardization of input and target variables.
  • Provides automatic selection and use of a validation data subset.
  • Provides automatic termination of training when the validation error stops improving.
  • Provides the ability to weight individual observations.
High-performance data mining
  • Includes the following high-performance-enabled SAS Enterprise Miner nodes:
    • HP Data Source.
    • HP Explore.
    • HP Transform.
    • HP Variable Selection.
    • HP Regression.
    • HP Neural Network.
    • HP Random Forest experimental.
    • HP Impute.
High-performance data mining DATABASE procedure
  • Creates summaries of key input data sources, including:
    • Number of observations.
    • Number of observations that contain a missing value.
    • Minimum observed value.
    • Maximum observed value.
    • Mean of observed values.
    • Standard deviation.
    • Measure of asymmetry.
    • Measure of the "heaviness of the tails."
    • Sum of all non-missing observations.
    • Corrected sum of squares.
    • Sum of squares.
High-performance random forest decision trees (experimental)
  • Creates an ensemble of hundreds of decision trees to predict a single target.
  • Trains hundreds of decision trees in parallel independently on different grid nodes.
  • Randomly selects the input variables considered for splitting a node from all available inputs.
  • Considers only a single variable that is most associated with the target for splitting.
High-performance count regression
  • Fits regression models where the dependent variable represents counts (e.g., the number of events recorded in some period).
  • Supports zero-inflated Poisson and negative binomial models.
  • Estimates parameters by using the maximum likelihood method.
High-performance severity models
  • Fits probability distributions for the severity (magnitude) of random events  (e.g., distributions of losses claimed under insurance policies as well as events with positive impacts such as the intermittent demand for certain products).
  • Fits regression models for the scale of the severity distribution.
  • Provides nine different probability distributions, including the Tweedie distribution.
  • Allows users to add additional probability distributions.
  • Can model data truncation and data censoring.

Screenshots

Screenshot
High-performance neural networks enable modelers to produce more training runs for significantly more lift and incremental predictive power.

View Screenshot

Screenshot
High-performance data mining enables modelers to explore, model and score using complete data – not just a subset – to get accurate and timely insights.

View Screenshot

System Requirements

Server tier
  • Linux x64 (64-bit): Novell SuSE 10 and 11; RHEL 5 and 6
Client tier (SAS® software)
  • Microsoft  Windows x64 (64-bit):  Windows XP Professional for x64, Windows Vista* for x64, Windows 7**  for x64
  • Linux x64 (64-bit): Novell SuSE 10 and 11; RHEL 5 and 6
Required SAS® software
  • Base SAS
  • SAS/ACCESS® to EMC Greenplum or SAS/ACCESS to Teradata
  • And/or SAS/STAT
  • And/or SAS Enterprise Miner
  • And/or SAS/ETS
Required hardware
  • Teradata database appliance along with Teradata 13.10 (or)
  • EMC Greenplum database appliance along with EMC Greenplum 4.2

* NOTE: Windows Vista supported editions are: Enterprise, Ultimate and Business.
** NOTE: Windows 7 supported editions are: Enterprise, Ultimate and Professional.

Ready to learn more?

Call us at 1-800-727-0025 (US and Canada) or request more information.