Common Data Sources

Sources of Data

  • Relational databases
  • Flat files and XML datasets
  • APIs and web services
  • Web scraping
  • Data streams and feeds

1. Relational databases

  • Examples: business activities, customer transactions, human resource activities, workflows
  • Applications: SQL server, ORACLE, MySQL, IBM DB2

2. Flat files and XML datasets

  • Flat files:
    • store data in plain text format
    • each line, or row, is one record
    • each value is separated by a delimiter
    • all of the data in the a file maps to a single table

3. APIs and web services

  • Examples: twitter and facebook APIs, stock market APIs, data lookup and validation APIs

4. Web scraping

  • Examples: collecting training and testing datasets for machine learning models…
  • Applications: BeautiofulSoup, Scrapy, Pandas, Selenium…

5. Data streams and feeds

  • Examples: Sensor data feeds for monitoring industial or farmoing machinery…

Data repository:

Data lake/ Data warehouse
Big data stores

© 2022 Tiffany's Blog All Rights Reserved. 本站访客数人次 本站总访问量
Theme by hiero