TY - GEN
T1 - An industrial perspective on web scraping characteristics and open issues
AU - Chiapponi, Elisa
AU - Dacier, Marc
AU - Thonnard, Olivier
AU - Fangar, Mohamed
AU - Mattsson, Mattias
AU - Rigal, Vincent
N1 - KAUST Repository Item: Exported on 2022-09-14
PY - 2022/7/25
Y1 - 2022/7/25
N2 - An ongoing battle has been running for more than a decade between e-commerce websites owners and web scrapers. Whenever one party finds a new technique to prevail, the other one comes up with a solution to defeat it. Based on our industrial experience, we know this problem is far from being solved. New solutions are needed to address automated threats. In this work, we will describe the actors taking part in the battle, the weapons at their disposal, and their allies on either side. We will present a real-world setup to explain how e-commerce websites operators try to defend themselves and the open problems they seek solutions for.
AB - An ongoing battle has been running for more than a decade between e-commerce websites owners and web scrapers. Whenever one party finds a new technique to prevail, the other one comes up with a solution to defeat it. Based on our industrial experience, we know this problem is far from being solved. New solutions are needed to address automated threats. In this work, we will describe the actors taking part in the battle, the weapons at their disposal, and their allies on either side. We will present a real-world setup to explain how e-commerce websites operators try to defend themselves and the open problems they seek solutions for.
UR - http://hdl.handle.net/10754/679931
UR - https://ieeexplore.ieee.org/document/9833640/
U2 - 10.1109/DSN-S54099.2022.00012
DO - 10.1109/DSN-S54099.2022.00012
M3 - Conference contribution
SN - 978-1-6654-0261-3
BT - 2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume (DSN-S)
PB - IEEE
ER -