Skip to main content

Amazon Redshift

Amazon Redshift FAQs

Tens of thousands of customers use Amazon Redshift every day to run SQL analytics in the cloud, processing exabytes of data for business insights. Whether your growing data is stored in operational data stores, data lakes, streaming data services or third-party datasets, Amazon Redshift helps you securely access, combine, and share data with minimal movement or copying. Amazon Redshift is deeply integrated with AWS database, analytics, and machine learning services to employ Zero-ETL approaches or help you access data in place for near real-time analytics, build machine learning models in SQL, and enable Apache Spark analytics using data in Redshift. Amazon Redshift Serverless enables your engineers, developers, data scientists, and analysts to get started easily and scale analytics quickly in a zero-administration environment. With its Massively Parallel Processing (MPP) engine and architecture that separates compute and storage for efficient scaling, and machine learning driven performance innovations (for example: Automated Materialized Views), Amazon Redshift is built for scale and delivers up to 5x better price performance than other cloud data warehouses."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#what-are-the-top-reasons-customers-choose-amazon-redshift-1","itemHeading":"What are the top reasons customers choose Amazon Redshift?","itemLongLoc":"

Thousands of customers choose Amazon Redshift to accelerate their time to insights because it is a powerful analytics system that integrates well with database and machine learning services, is streamlined to use, and can become a central service to deliver on all their analytics needs. Amazon Redshift Serverless automatically provisions and scales data warehouse capacity to deliver high performance for demanding and unpredictable workloads. Amazon Redshift offers leading price performance for diverse analytics workloads, whether it is dashboarding, application development, data sharing, ETL (Extract, Transform, Load) jobs or several others. With tens of thousands of customers running analytics on terabytes to petabytes of data, Amazon Redshift optimizes real-world customer workload performance, based on fleet performance telemetry, and delivers performance that scales linearly to the workload, while keeping costs low. Performance innovations are available to customers at no additional cost. Amazon Redshift lets you get insights from running real-time and predictive analytics on all your data across your operational databases, data lake, data warehouse, streaming data, and third-party datasets. Amazon Redshift supports industry-leading security with built-in identity management and federation for single sign-on (SSO), multi-factor authentication, column-level access control, row-level security, role-based access control, Amazon Virtual Private Cloud (Amazon VPC), and faster cluster resize."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#how-does-amazon-redshift-simplify-data-warehouse-and-analytics-management-2","itemHeading":"How does Amazon Redshift simplify data warehouse and analytics management?","itemLongLoc":"

Amazon Redshift is fully managed by AWS so you no longer need to worry about data warehouse management tasks such as hardware provisioning, software patching, setup, configuration, monitoring nodes and drives to recover from failures, or backups. AWS manages the work needed to set up, operate, and scale a data warehouse on your behalf, freeing you to focus on building your applications. Amazon Redshift Serverless automatically provisions and scales the data warehouse capacity to deliver high performance for demanding and unpredictable workloads, and you pay only for the resources you use. Amazon Redshift also has automatic tuning capabilities, and surfaces recommendations for managing your warehouse in Redshift Advisor. With Redshift Spectrum, Amazon Redshift manages all the computing infrastructure, load balancing, planning, scheduling, and execution of your queries on data stored in Amazon S3. Amazon Redshift enables analytics on all your data with deep integration into database services with features like Amazon Aurora Zero-ETL to Amazon Redshift and federated querying to access data in place from operational databases like Amazon RDS and your Amazon S3 data lake. Redshift enables streamlined data ingestion with no-code, automated data pipelines that ingest streaming data or Amazon S3 files automatically. Redshift is also integrated with AWS Data Exchange enabling users to find, subscribe to, and query third party datasets and combine with their data for comprehensive insights. With native integration into Amazon SageMaker, customers can stay right within their data warehouse and create, train, and build machine learning models in SQL. Amazon Redshift delivers on all your SQL analytics needs with up to 5x better price performance than other cloud data warehouses."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#what-are-the-deployment-options-for-amazon-redshift-3","itemHeading":"What are the deployment options for Amazon Redshift?","itemLongLoc":"

Amazon Redshift is a fully managed service and offers both provisioned and serverless options, making it more efficient for you to run and scale analytics without having to manage your data warehouse. You can spin up a new Amazon Redshift Serverless endpoint to automatically provision the data warehouse in seconds or you can choose the provisioned option for predictable workloads."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#how-do-i-get-started-with-amazon-redshift-4","itemHeading":"How do I get started with Amazon Redshift?","itemLongLoc":"

With just a few steps in the AWS Management Console, you can start querying data. You can take advantage of pre-loaded sample datasets, including benchmark datasets TPC-H, TPC-DS, and other sample queries to kick start analytics immediately. To get started with Amazon Redshift Serverless, choose “Try Amazon Redshift Serverless” and start querying data. Get started here."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#how-does-the-performance-of-amazon-redshift-compare-to-that-of-other-data-warehouses-5","itemHeading":"How does the performance of Amazon Redshift compare to that of other data warehouses?","itemLongLoc":"

TPC-DS benchmark results show that Amazon Redshift provides the best price performance out of the box, even for a comparatively small 3 TB dataset. Amazon Redshift delivers up to 5x better price performance than other cloud data warehouses. This means that you can benefit from Amazon Redshift’s leading price performance from the start without manual tuning. Based on our performance fleet telemetry, we also know that most workloads are short query workloads (workloads that run in less than 1 second). For these workloads, the latest benchmarks demonstrate that Amazon Redshift offers up to 7x better price performance on high concurrency, low latency workloads than other cloud data warehouses. Learn more here."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#can-i-get-help-to-learn-more-about-and-onboard-to-amazon-redshift-6","itemHeading":"Can I get help to learn more about and onboard to Amazon Redshift?","itemLongLoc":"

 Yes, Amazon Redshift specialists are available to answer questions and provide support. Contact us and you’ll hear back from us in one business day to discuss how AWS can help your organization."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#what-is-amazon-redshift-managed-storage-7","itemHeading":"What is Amazon Redshift managed storage?","itemLongLoc":"

Amazon Redshift managed storage is available with serverless and RA3 node types and lets you scale and pay for compute and storage independently so you can size your cluster based only on your compute needs. It automatically uses high-performance SSD-based local storage as tier-1 cache and takes advantage of optimizations such as data block temperature, data block age, and workload patterns to deliver high performance while scaling storage automatically to Amazon S3 when needed without requiring any action."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#how-do-i-use-amazon-redshifts-managed-storage-8","itemHeading":"How do I use Amazon Redshift’s managed storage?","itemLongLoc":"

If you are already using Amazon Redshift Dense Storage or Dense Compute nodes, you can use Elastic Resize to upgrade your existing clusters to the new compute instance RA3. Amazon Redshift Serverless and clusters using the RA3 instance automatically use Redshift-managed storage to store data. No other action outside of using Amazon Redshift Serverless or RA3 instances is required to use this capability."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#how-can-i-run-queries-from-redshift-for-the-data-stored-in-the-aws-data-lake-9","itemHeading":"How can I run queries from Redshift for the data stored in the AWS Data Lake?","itemLongLoc":"

Amazon Redshift Spectrum is a feature of Amazon Redshift that lets you run queries against your data lake in Amazon S3, with no data loading or ETL required. When you issue an SQL query, it goes to the Amazon Redshift endpoint, which generates and optimizes a query plan. Amazon Redshift determines what data is local and what is in Amazon S3, generates a plan to minimize the amount of S3 data that must be read, and requests Amazon Redshift Spectrum workers out of a shared resource pool to read and process data from Amazon S3."},"metadata":{"tags":[{"name":"General","namespaceId":"awt-content-topics#ams#c1","id":"awt-content-topics#ams#c1#general-0"}]}},{"fields":{"id":"awt-content-topics#when-should-i-consider-using-ra3-instances-10","itemHeading":"When should I consider using RA3 instances?","itemLongLoc":"

Consider choosing RA3 node types in these cases: \n