Skip to the main content

Professional paper

Data collection automation with the help of Selenium

Denis Jelaš ; Visoka škola za menadžment i dizajn Aspira
Petar Olujić orcid id orcid.org/0009-0004-3728-3670 ; Visoka škola za menadžment i dizajn Aspira
Karlo Leder orcid id orcid.org/0009-0008-3995-0122 ; Visoka škola za menadžment i dizajn Aspira


Full text: croatian pdf 555 Kb

page 51-57

downloads: 208

cite


Abstract

Data collection automation is a very relevant topic because data is the most valuable resource today, and it is interesting to demonstrate how data collection from the Internet can be automated. Selenium is one of the most common tools that are used today for automating web browsers for the purpose of data collection. Therefore, it was chosen to specify the topic of data collection from web pages (web scraping) and focus on a specific area.
The paper provides a general overview of web browser automation for data collection, covering its history, techniques, technologies, and legal aspects. Finally, a practical example of using a program with Selenium for data collection from the website of the Croatian Academic and Research Network (CARNet) is presented. The program is developed as a console application in programming tool .NET using the C# programming language.

Keywords

data collection automation, web scraping, Selenium

Hrčak ID:

311808

URI

https://hrcak.srce.hr/311808

Publication date:

21.12.2023.

Article data in other languages: croatian

Visits: 767 *