Você está na página 1de 11

A Dive into

Web Scraper
World
Introduction
What is web scraping and when would you want to use it?

Web scraping is related to


web indexing, whose task is to
index information on the web
with the help of a bot or web
crawler.

2
IS WEB
SCRAPING
LEGAL?
There are lots of different views of
different people on the legal and illegal
aspects of Scraping the Web. In today’s
world we can see many examples of the
legal use of Web Scraper such as price
comparison websites and reviewing
Websites.
“The key part is what you
want to do with the
scraped data.”

If you use it for your own, personal


use, then it is legal as it falls under
fair use doctrine.

The complications start if you


want to use scraped data for
other, especially commercial,
purposes.
4
Numerous Lawsuits that Tie
Web Scraping

⬡ Case of AT&T’s
“Andrew Auernheimer” was convicted of hacking based on the act of
web scraping. Although the data was unprotected and publically
available via AT&T’s website, the fact that he wrote web scrapers to
harvest that data in mass amounted to “brute force attack”.

This charge is a felony violation that is on par with hacking


or denial of service attacks and carries up to a 15-year
sentence for each charge.
Numerous Lawsuits that Tie
any Web Scraping
⬡ Case of “The Ebay’s Action”
Not much could be done about the practice until in 2000 eBay filed a
preliminary injunction against Bidder’s Edge. In the injunction eBay
claimed that the use of bots on the site, against the will of the company
violated Trespass to Chattels law.

The court granted the injunction because users had to opt in


and agree to the terms of service on the site and that a large
number of bots could be disruptive to eBay’s computer
systems.

6
Designing principles and
methods

Web Scraper

Web Crawler Data Extraction


for crawling from crawled
links. links.

7
Web Crawler
A Web Crawler is an internet bot
which systematically browses
the world wide web , for the
purpose of web indexing.
Data flow diagram

webpages Web structured


scraping data

9
Mission of our project!

❖ We look to bring about a massive change in


the sorting process of companies during
placements from a student's perspective.

❖ This initiative will extensively help students find


companies which suit their domain and need
more effectively reducing a large part of
manual work involved in the current setting.

10
References
❖ https://en.wikipedia.org/wiki/Web_scraping.h
tml

❖ https://www.webharvy.com/articles/what-is-
web-scraping.html

❖ http://resources.distilnetworks.com/h/i/5382
2104-is-webscraping-illegal-depends-on-what-
the-meaning-of-theword-is-is/181642

❖ https://www.quora.com/What-is-the-legality-
of-web-scraping

❖ https://en.wikipedia.org/wiki/Web_crawler

❖ Kolari , P. and Joshi, A. , “Web mining : research


and practice , Computing in Science &
Engineering”, IEEE Transactions on Knowledge
and Data Engineering, vol. 6, no. 2,Vol. 6 , No. 4 ,
11
2004