|
1 | 1 | # Bright Data Python SDK Changelog |
2 | 2 |
|
| 3 | +## Version 2.2.1 - 100 Datasets API |
| 4 | + |
| 5 | +### ✨ New Features |
| 6 | + |
| 7 | +#### Expanded Datasets Coverage |
| 8 | +Added 92 new dataset integrations, bringing the total to **100 datasets**: |
| 9 | + |
| 10 | +- **Luxury Brands**: Loewe, Berluti, Moynat, Hermes, Delvaux, Prada, Montblanc, YSL, Dior, Balenciaga, Bottega Veneta, Celine, Chanel, Fendi |
| 11 | +- **E-commerce**: Amazon (Reviews, Sellers), Walmart, Shopee, Lazada, Zalando, Sephora, Zara, Mango, Massimo Dutti, Asos, Shein, Ikea, H&M, Lego, Mouser, Digikey |
| 12 | +- **Social Media**: Instagram (Profiles, Posts), TikTok, Pinterest (Posts, Profiles), YouTube (Profiles, Videos, Comments), Facebook Pages Posts |
| 13 | +- **Real Estate**: Zillow, Airbnb, Australia Real Estate, Otodom Poland, Zonaprop Argentina, Metrocuadrado, Infocasas Uruguay, Properati, Toctoc, Inmuebles24 Mexico, Yapo Chile |
| 14 | +- **Business Data**: Glassdoor (Companies, Reviews, Jobs), Indeed (Companies, Jobs), ZoomInfo, PitchBook, G2, Trustpilot, TrustRadius, Owler, Slintel, Manta, VentureRadar, Companies Enriched, Employees Enriched |
| 15 | +- **Other**: World Zipcodes, US Lawyers, Google Maps Reviews, Yelp, Xing Profiles, OLX Brazil, Webmotors Brasil, Chileautos, LinkedIn Jobs |
| 16 | + |
| 17 | +#### SERP Pagination Support |
| 18 | +Added sequential querying to retrieve more than 10 search results from Google: |
| 19 | + |
| 20 | +```python |
| 21 | +async with BrightDataClient() as client: |
| 22 | + # Get up to 50 results with automatic pagination |
| 23 | + results = await client.search.google( |
| 24 | + query="python programming", |
| 25 | + num_results=50 # Fetches multiple pages sequentially |
| 26 | + ) |
| 27 | +``` |
| 28 | + |
| 29 | +--- |
| 30 | + |
| 31 | +## Version 2.2.0 - Datasets API |
| 32 | + |
| 33 | +### ✨ New Features |
| 34 | + |
| 35 | +#### Datasets API |
| 36 | +Access Bright Data's pre-collected datasets with filtering and export capabilities. |
| 37 | + |
| 38 | +```python |
| 39 | +async with BrightDataClient() as client: |
| 40 | + # Filter dataset records |
| 41 | + snapshot_id = await client.datasets.amazon_products( |
| 42 | + filter={"name": "rating", "operator": ">=", "value": 4.5}, |
| 43 | + records_limit=100 |
| 44 | + ) |
| 45 | + # Download results |
| 46 | + data = await client.datasets.amazon_products.download(snapshot_id) |
| 47 | +``` |
| 48 | + |
| 49 | +**8 Datasets:** LinkedIn Profiles, LinkedIn Companies, Amazon Products, Crunchbase Companies, IMDB Movies, NBA Players Stats, Goodreads Books, World Population |
| 50 | + |
| 51 | +**Export Utilities:** |
| 52 | +```python |
| 53 | +from brightdata.datasets import export_json, export_csv |
| 54 | +export_json(data, "results.json") |
| 55 | +export_csv(data, "results.csv") |
| 56 | +``` |
| 57 | + |
| 58 | +### 📓 Notebooks |
| 59 | +- `notebooks/datasets/linkedin/linkedin.ipynb` - LinkedIn datasets (profiles & companies) |
| 60 | +- `notebooks/datasets/amazon/amazon.ipynb` - Amazon products dataset |
| 61 | +- `notebooks/datasets/crunchbase/crunchbase.ipynb` - Crunchbase companies dataset |
| 62 | + |
| 63 | +--- |
| 64 | + |
3 | 65 | ## Version 2.1.2 - Web Scrapers & Notebooks |
4 | 66 |
|
5 | 67 | ### 🐛 Bug Fixes |
|
0 commit comments