Mastering the Data Science Workflow


拍摄者 Fer Troulik未飞溅

The collection stage involves acquiring the necessary data in order to perform a meaningful analysis based upon accurate information.

Techniques

Data Requirements
Define which data is needed to properly approach the project (e.g. format, variables, time range, granularity)

Data Sources
Find reliable and relevant data sources (e.g. databases, APIs, files, sensor readings)

Authentication
Secure necessary permissions to access the data (e.g. email/password, OAuth, API key, robots.txt)

Collection
Acquire the data using appropriate methods (e.g. SQL queries, API calls, web scraping, manual data entry)

Data Management
Handle the data in accordance with best practices (e.g. data quality, data governance, data security)



来源链接

发表评论

您的电子邮件地址不会被公开。 必需的地方已做标记 *

您可以使用这些HTML标签和属性: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

zh_CNChinese