Professional paper
Data collection automation with the help of Selenium
Denis Jelaš
; Visoka škola za menadžment i dizajn Aspira
Petar Olujić
orcid.org/0009-0004-3728-3670
; Visoka škola za menadžment i dizajn Aspira
Karlo Leder
orcid.org/0009-0008-3995-0122
; Visoka škola za menadžment i dizajn Aspira
Abstract
Data collection automation is a very relevant topic because data is the most valuable resource today, and it is interesting to demonstrate how data collection from the Internet can be automated. Selenium is one of the most common tools that are used today for automating web browsers for the purpose of data collection. Therefore, it was chosen to specify the topic of data collection from web pages (web scraping) and focus on a specific area.
The paper provides a general overview of web browser automation for data collection, covering its history, techniques, technologies, and legal aspects. Finally, a practical example of using a program with Selenium for data collection from the website of the Croatian Academic and Research Network (CARNet) is presented. The program is developed as a console application in programming tool .NET using the C# programming language.
Keywords
data collection automation, web scraping, Selenium
Hrčak ID:
311808
URI
Publication date:
21.12.2023.
Visits: 767 *