04/02/2021
Tired of working with basic datasets that are not relevant to real-world business problems?
Here are few recommendations for data repositories,
1) Kaggle: Hosts a wide range of publicly available datasets
2) UCI- ML Repository: the University of California Irvine maintains 559 data sets as a service to the ML community
3) Google Dataset Search: Using simple keyword search, users can discover 1000;s of repositories across the web
4) OPen Data on AWS: Exists to help people discover and share datasets that are available via AWS resources
5) Access Twitter data through Twitter API to extract publicly available data through endpoints