The Academies and Workshops Program is aimed at professionals who work with the collection, organization or analysis of data. Participation costs vary depending on the type of entity in which the participant performs their primary work.
Resource: Lanselotte Oliveras Vega, MS. Assistant for Statistical Projects, Statistics Institute of Puerto Rico.
Prerequisites:
· Basic knowledge of statistics
· Advanced knowledge of R and Rstudio
· Access to a computer with an internet connection
· Basic mastery of the use of computers, Internet and online education platforms
Description:
This virtual workshop focuses on teaching techniques for web scraping using R, through the rvest packages. El Web scraping or “web data extraction” consists of an automated process for collecting data and information from websites, speeding up the initial phase of obtaining data. The practical application of these skills in data analysis will be explored, always respecting the ethical and legal considerations of Web scraping.
Objectives:
· Explain ethical and legal aspects of Webscraping.
· Introduce participants to the techniques of Web scraping using the `rvest` library.
· Analyze data extracted from web pages.
Goals:
· Know the ethical and legal implications of Web scraping.
· Develop skills in handling dervest for scraping static web pages.