Press Release

<< Back
Amazon Web Services Launches "Public Data Sets on AWS," Enabling Developers and Researchers to Cost-Effectively Create, Share and Consume Massive Data Sets Available Free of Charge

AWS Invites Developers, Researchers, Universities and Businesses to Share Their Data in the Cloud at No Charge; Examples of Data Sets Include Mapping of the Human Genome, Us Census Data, and Economic Statistics

SEATTLE--(BUSINESS WIRE)--Dec. 4, 2008--Amazon Web Services LLC (AWS), a subsidiary of Inc. (NASDAQ: AMZN), today launched "Public Data Sets on AWS," providing access to a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications. AWS is hosting the public data sets at no charge for the community, and like all of AWS services, users pay only for the compute and storage they consume with their own applications. Data sets already available include various U.S. Census databases from the U.S. Census Bureau, 3-D chemical structures provided by Indiana University, and an annotated form of the Human Genome from Ensembl. More data sets will be available soon, including a wide range of economic statistics from the Bureau of Economic Analysis and additional scientific data sets.

Previously, large data sets such as the Human Genome and U.S. Census data required many hours to locate, download and customize. Now, anyone can access these large data sets from their Amazon Elastic Compute Cloud (Amazon EC2) instances and start computing on the data within minutes. By growing the number of people with access to important and useful data, and making it easy to compute on that data with cost-efficient services such as Amazon EC2, AWS hopes to fuel innovation and further accelerate the pace of new discoveries.

"For over five years AWS has been working to lower the barriers to entry, level the playing field, and make it possible for our customers to be successful based on their ideas, not on their resources," said Adam Selipsky, Vice President of Product Management and Developer Relations for Amazon Web Services. "Public Data Sets on AWS is the latest of these efforts, and we can't wait to see the discoveries and innovations that could stem from this ecosystem."

Select public data sets are hosted on Amazon EC2 for free as Amazon Elastic Block Store (Amazon EBS) snapshots. Amazon EC2 customers can access this data by creating their own personal Amazon EBS volumes, using the public data set snapshots as a starting point. They can then access, modify and perform computation on these volumes directly using their Amazon EC2 instances and just pay for the compute and storage resources that they use. If available, researchers can also use pre-configured Amazon Machine Images (AMIs) with tools like Inquiry by BioTeam to perform their analysis.

"Public Data Sets on AWS will enable me and many of my colleagues to collaborate with each other by sharing our commonly used data sets, research environments and tools," said Dr. Peter Tonellato from the Harvard Medical School. "We can set up a controlled environment in minutes, run our computational analysis for a couple of hours, and shut down the environment. Our results are completely repeatable. I only pay for the compute time I use, and more importantly I can spend more time focusing on research, not downloading and setting up computational infrastructure."

"Bioinformatics is a hugely exciting area which is providing much insight into our understanding of biology and, particularly, the genetic basis of many human diseases like cancer and diabetes. The genome is a complex thing, however; it presents us with a potential source of invaluable information but also with great challenges in how to store, analyze and annotate it, and how to make both the raw genomic information and our annotations available to as many people as possible," said Dr. Glenn Proctor, Ensembl Software Coordinator at the EBI. "Ensembl's approach has always been to try and lower the barriers to entry so that a researcher using a desktop PC in a lab or a laptop in an airport departure lounge has access to high-quality, up to the minute genetic information that they can use in their work. Amazon EC2 allows us to go even further and make all our data available in a robust, scalable and flexible form that anyone with an AWS account can use."

For more information about the Public Data Sets on AWS, to get started using a data set, or to submit a data set, please visit

About, Inc. (NASDAQ:AMZN), a Fortune 500 company based in Seattle, opened on the World Wide Web in July 1995 and today offers Earth's Biggest Selection., Inc., seeks to be Earth's most customer-centric company, where customers can find and discover anything they might want to buy online, and endeavors to offer its customers the lowest possible prices. and other sellers offer millions of unique new, refurbished and used items in categories such as books, movies, music & games, digital downloads, electronics & computers, home & garden, toys, kids & baby, grocery, apparel, shoes & jewelry, health & beauty, sports & outdoors, and tools, auto & industrial.

Amazon Web Services provides Amazon's developer customers with access to in-the-cloud infrastructure services based on Amazon's own back-end technology platform, which developers can use to enable virtually any type of business. Examples of the services offered by Amazon Web Services are Amazon Elastic Compute Cloud (Amazon EC2), Amazon Simple Storage Service (Amazon S3), Amazon SimpleDB, Amazon Simple Queue Service (Amazon SQS), Amazon Flexible Payments Service (Amazon FPS), and Amazon Mechanical Turk.

Amazon and its affiliates operate websites, including,,,,,, and the Joyo Amazon websites at and

As used herein, "," "we," "our" and similar terms include, Inc., and its subsidiaries, unless the context indicates otherwise.

Amazon Forward-Looking Statements

This announcement contains forward-looking statements within the meaning of Section 27A of the Securities Act of 1933 and Section 21E of the Securities Exchange Act of 1934. Actual results may differ significantly from management's expectations. These forward-looking statements involve risks and uncertainties that include, among others, risks related to competition, management of growth, new products, services and technologies, potential fluctuations in operating results, international expansion, outcomes of legal proceedings and claims, fulfillment center optimization, seasonality, commercial agreements, acquisitions and strategic transactions, foreign exchange rates, system interruption, significant amount of indebtedness, inventory, government regulation and taxation, payments and fraud. More information about factors that potentially could affect's financial results is included in's filings with the Securities and Exchange Commission, including its Annual Report on Form 10-K for the year ended December 31, 2007, and subsequent filings.

Media Hotline, 206-266-7180

Source:, Inc.