- Description:
Amazon Customer Reviews (a.k.a. Product Reviews) is one of Amazons iconic products. In a period of over two decades since the first review in 1995, millions of Amazon customers have contributed over a hundred million reviews to express opinions and describe their experiences regarding products on the Amazon.com website. This makes Amazon Customer Reviews a rich source of information for academic researchers in the fields of Natural Language Processing (NLP), Information Retrieval (IR), and Machine Learning (ML), amongst others. Accordingly, we are releasing this data to further research in multiple disciplines related to understanding customer product experiences. Specifically, this dataset was constructed to represent a sample of customer evaluations and opinions, variation in the perception of a product across geographical regions, and promotional intent or bias in reviews.
Over 130+ million customer reviews are available to researchers as part of this release. The data is available in TSV files in the amazon-reviews-pds S3 bucket in AWS US East Region. Each line in the data files corresponds to an individual review (tab delimited, with no quote and escape characters).
Each Dataset contains the following columns : marketplace - 2 letter country code of the marketplace where the review was written. customer_id - Random identifier that can be used to aggregate reviews written by a single author. review_id - The unique ID of the review. product_id - The unique Product ID the review pertains to. In the multilingual dataset the reviews for the same product in different countries can be grouped by the same product_id. product_parent - Random identifier that can be used to aggregate reviews for the same product. product_title - Title of the product. product_category - Broad product category that can be used to group reviews (also used to group the dataset into coherent parts). star_rating - The 1-5 star rating of the review. helpful_votes - Number of helpful votes. total_votes - Number of total votes the review received. vine - Review was written as part of the Vine program. verified_purchase - The review is on a verified purchase. review_headline - The title of the review. review_body - The review text. review_date - The date the review was written.
Homepage: https://s3.amazonaws.com/amazon-reviews-pds/readme.html
Source code:
tfds.datasets.amazon_us_reviews.Builder
Versions:
0.1.0
(default): No release notes.
Feature structure:
FeaturesDict({
'data': FeaturesDict({
'customer_id': string,
'helpful_votes': int32,
'marketplace': string,
'product_category': string,
'product_id': string,
'product_parent': string,
'product_title': string,
'review_body': string,
'review_date': string,
'review_headline': string,
'review_id': string,
'star_rating': int32,
'total_votes': int32,
'verified_purchase': ClassLabel(shape=(), dtype=int64, num_classes=2),
'vine': ClassLabel(shape=(), dtype=int64, num_classes=2),
}),
})
- Feature documentation:
Feature | Class | Shape | Dtype | Description |
---|---|---|---|---|
FeaturesDict | ||||
data | FeaturesDict | |||
data/customer_id | Tensor | string | ||
data/helpful_votes | Tensor | int32 | ||
data/marketplace | Tensor | string | ||
data/product_category | Tensor | string | ||
data/product_id | Tensor | string | ||
data/product_parent | Tensor | string | ||
data/product_title | Tensor | string | ||
data/review_body | Tensor | string | ||
data/review_date | Tensor | string | ||
data/review_headline | Tensor | string | ||
data/review_id | Tensor | string | ||
data/star_rating | Tensor | int32 | ||
data/total_votes | Tensor | int32 | ||
data/verified_purchase | ClassLabel | int64 | ||
data/vine | ClassLabel | int64 |
Supervised keys (See
as_supervised
doc):None
Figure (tfds.show_examples): Not supported.
Citation:
amazon_us_reviews/Wireless_v1_00 (default config)
Config description: A dataset consisting of reviews of Amazon Wireless_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
1.59 GiB
Dataset size:
7.21 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
9,002,021 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Watches_v1_00
Config description: A dataset consisting of reviews of Amazon Watches_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
155.42 MiB
Dataset size:
753.08 MiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
960,872 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Video_Games_v1_00
Config description: A dataset consisting of reviews of Amazon Video_Games_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
453.19 MiB
Dataset size:
1.78 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
1,785,997 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Video_DVD_v1_00
Config description: A dataset consisting of reviews of Amazon Video_DVD_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
1.41 GiB
Dataset size:
5.31 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
5,069,140 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Video_v1_00
Config description: A dataset consisting of reviews of Amazon Video_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
132.49 MiB
Dataset size:
465.08 MiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
380,604 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Toys_v1_00
Config description: A dataset consisting of reviews of Amazon Toys_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
799.61 MiB
Dataset size:
3.61 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
4,864,249 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Tools_v1_00
Config description: A dataset consisting of reviews of Amazon Tools_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
318.32 MiB
Dataset size:
1.37 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
1,741,100 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Sports_v1_00
Config description: A dataset consisting of reviews of Amazon Sports_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
832.06 MiB
Dataset size:
3.64 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
4,850,360 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Software_v1_00
Config description: A dataset consisting of reviews of Amazon Software_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
89.66 MiB
Dataset size:
366.16 MiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
341,931 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Shoes_v1_00
Config description: A dataset consisting of reviews of Amazon Shoes_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
612.50 MiB
Dataset size:
3.06 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
4,366,916 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Pet_Products_v1_00
Config description: A dataset consisting of reviews of Amazon Pet_Products_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
491.92 MiB
Dataset size:
2.11 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
2,643,619 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Personal_Care_Appliances_v1_00
Config description: A dataset consisting of reviews of Amazon Personal_Care_Appliances_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
16.82 MiB
Dataset size:
75.03 MiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'train' |
85,981 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/PC_v1_00
Config description: A dataset consisting of reviews of Amazon PC_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
1.41 GiB
Dataset size:
5.93 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
6,908,554 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Outdoors_v1_00
Config description: A dataset consisting of reviews of Amazon Outdoors_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
428.16 MiB
Dataset size:
1.83 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
2,302,401 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Office_Products_v1_00
Config description: A dataset consisting of reviews of Amazon Office_Products_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
488.59 MiB
Dataset size:
2.12 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
2,642,434 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Musical_Instruments_v1_00
Config description: A dataset consisting of reviews of Amazon Musical_Instruments_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
184.43 MiB
Dataset size:
792.16 MiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
904,765 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Music_v1_00
Config description: A dataset consisting of reviews of Amazon Music_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
1.42 GiB
Dataset size:
5.16 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
4,751,577 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Mobile_Electronics_v1_00
Config description: A dataset consisting of reviews of Amazon Mobile_Electronics_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
21.81 MiB
Dataset size:
94.97 MiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'train' |
104,975 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Mobile_Apps_v1_00
Config description: A dataset consisting of reviews of Amazon Mobile_Apps_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
532.11 MiB
Dataset size:
3.13 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
5,033,376 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Major_Appliances_v1_00
Config description: A dataset consisting of reviews of Amazon Major_Appliances_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
23.23 MiB
Dataset size:
96.36 MiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'train' |
96,901 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Luggage_v1_00
Config description: A dataset consisting of reviews of Amazon Luggage_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
57.53 MiB
Dataset size:
274.07 MiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
348,657 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Lawn_and_Garden_v1_00
Config description: A dataset consisting of reviews of Amazon Lawn_and_Garden_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
464.22 MiB
Dataset size:
2.00 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
2,557,288 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Kitchen_v1_00
Config description: A dataset consisting of reviews of Amazon Kitchen_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
887.63 MiB
Dataset size:
3.85 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
4,880,466 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Jewelry_v1_00
Config description: A dataset consisting of reviews of Amazon Jewelry_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
235.58 MiB
Dataset size:
1.22 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
1,767,753 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Home_Improvement_v1_00
Config description: A dataset consisting of reviews of Amazon Home_Improvement_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
480.02 MiB
Dataset size:
2.08 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
2,634,781 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Home_Entertainment_v1_00
Config description: A dataset consisting of reviews of Amazon Home_Entertainment_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
184.22 MiB
Dataset size:
741.78 MiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
705,889 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Home_v1_00
Config description: A dataset consisting of reviews of Amazon Home_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
1.01 GiB
Dataset size:
4.60 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
6,221,559 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Health_Personal_Care_v1_00
Config description: A dataset consisting of reviews of Amazon Health_Personal_Care_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
964.34 MiB
Dataset size:
4.21 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
5,331,449 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Grocery_v1_00
Config description: A dataset consisting of reviews of Amazon Grocery_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
382.74 MiB
Dataset size:
1.77 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
2,402,458 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Gift_Card_v1_00
Config description: A dataset consisting of reviews of Amazon Gift_Card_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
11.57 MiB
Dataset size:
93.82 MiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'train' |
149,086 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Furniture_v1_00
Config description: A dataset consisting of reviews of Amazon Furniture_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
142.08 MiB
Dataset size:
646.69 MiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
792,113 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Electronics_v1_00
Config description: A dataset consisting of reviews of Amazon Electronics_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
666.45 MiB
Dataset size:
2.74 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
3,093,869 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Digital_Video_Games_v1_00
Config description: A dataset consisting of reviews of Amazon Digital_Video_Games_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
26.17 MiB
Dataset size:
124.19 MiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'train' |
145,431 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Digital_Video_Download_v1_00
Config description: A dataset consisting of reviews of Amazon Digital_Video_Download_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
483.49 MiB
Dataset size:
2.68 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
4,057,147 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Digital_Software_v1_00
Config description: A dataset consisting of reviews of Amazon Digital_Software_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
18.12 MiB
Dataset size:
89.59 MiB
Auto-cached (documentation): Yes
Splits:
Split | Examples |
---|---|
'train' |
102,084 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Digital_Music_Purchase_v1_00
Config description: A dataset consisting of reviews of Amazon Digital_Music_Purchase_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
241.82 MiB
Dataset size:
1.20 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
1,688,884 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Digital_Ebook_Purchase_v1_00
Config description: A dataset consisting of reviews of Amazon Digital_Ebook_Purchase_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
2.51 GiB
Dataset size:
10.82 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
12,520,722 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Camera_v1_00
Config description: A dataset consisting of reviews of Amazon Camera_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
422.15 MiB
Dataset size:
1.69 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
1,801,974 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Books_v1_00
Config description: A dataset consisting of reviews of Amazon Books_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
2.55 GiB
Dataset size:
10.01 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
10,319,090 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Beauty_v1_00
Config description: A dataset consisting of reviews of Amazon Beauty_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
871.73 MiB
Dataset size:
3.88 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
5,115,666 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Baby_v1_00
Config description: A dataset consisting of reviews of Amazon Baby_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
340.84 MiB
Dataset size:
1.45 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
1,752,932 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Automotive_v1_00
Config description: A dataset consisting of reviews of Amazon Automotive_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
555.18 MiB
Dataset size:
2.54 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
3,514,942 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Apparel_v1_00
Config description: A dataset consisting of reviews of Amazon Apparel_v1_00 products in US marketplace. Each product has its own version as specified with it.
Download size:
618.59 MiB
Dataset size:
3.99 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
5,906,333 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Digital_Ebook_Purchase_v1_01
Config description: A dataset consisting of reviews of Amazon Digital_Ebook_Purchase_v1_01 products in US marketplace. Each product has its own version as specified with it.
Download size:
1.21 GiB
Dataset size:
4.87 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
5,101,693 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Books_v1_01
Config description: A dataset consisting of reviews of Amazon Books_v1_01 products in US marketplace. Each product has its own version as specified with it.
Download size:
2.51 GiB
Dataset size:
8.48 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
6,106,719 |
- Examples (tfds.as_dataframe):
amazon_us_reviews/Books_v1_02
Config description: A dataset consisting of reviews of Amazon Books_v1_02 products in US marketplace. Each product has its own version as specified with it.
Download size:
1.24 GiB
Dataset size:
4.15 GiB
Auto-cached (documentation): No
Splits:
Split | Examples |
---|---|
'train' |
3,105,520 |
- Examples (tfds.as_dataframe):