An industrial perspective on web scraping characteristics and open issues

Elisa Chiapponi, Marc Dacier, Olivier Thonnard, Mohamed Fangar, Mattias Mattsson, Vincent Rigal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Scopus citations

Abstract

An ongoing battle has been running for more than a decade between e-commerce websites owners and web scrapers. Whenever one party finds a new technique to prevail, the other one comes up with a solution to defeat it. Based on our industrial experience, we know this problem is far from being solved. New solutions are needed to address automated threats. In this work, we will describe the actors taking part in the battle, the weapons at their disposal, and their allies on either side. We will present a real-world setup to explain how e-commerce websites operators try to defend themselves and the open problems they seek solutions for.
Original languageEnglish (US)
Title of host publication2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume (DSN-S)
PublisherIEEE
ISBN (Print)978-1-6654-0261-3
DOIs
StatePublished - Jul 25 2022

Bibliographical note

KAUST Repository Item: Exported on 2022-09-14

Fingerprint

Dive into the research topics of 'An industrial perspective on web scraping characteristics and open issues'. Together they form a unique fingerprint.

Cite this