7 Tem 2013

In this article, I will show you how to extract data from websites quickly and regularly using WebHarvy. But before, let's talk about Web Scraping if you're ready.


Web scraping, web harvesting, or web data extraction is data scraping used for extracting data from websites. Web scraping software may access the World Wide Web directly using the Hypertext Transfer Protocol, or through a web browser. While web scraping can be done manually by a software user, the term typically refers to automated processes implemented using a bot or web crawler. It is a form of copying, in which specific data is gathered and copied from the web, typically into a central local database or spreadsheet, for later retrieval or analysis.

I will give you an example to make you understand more easily. Let's say you work in a marketing firm and the task given to you is the price list of a product in various companies. It is called web scraping to capture that price list from their websites with various software. So, almost all companies use web scraping in the business world. Of course you can also do this for your own personal interests.

First, let's download our software. You can download it from the link below:


After installing and running the software, you will see this:


Now enter the website address that you want to capture data.


Click "Start" button


Then we click on the part of the website where we will extract its data. First, let's get the name of the product from the website. For this, we should click on the product name. And we will see a window. Click "Capture Text" button


You should give a name:


When we look at the "Capture Data Preview" section, it listed the names of all products on the website.


Now, let's take the prices of the products on the website before the discount. We should click on the price of the product before the discount, and we will see a window, click the "Capture Test" button.


We see that the regular price are added right across to the product names


After doing the same operations for discount prices, now there are "product name", "regular price" and "reduced price"


Now let's click on the "Stop" button. Then we click on the "Start-Mine" button.


We click on the "Start" button again.


As you can see, the data extracted on the table. Now we should click the Export button.


We can extract the data in different formats. I want to extract it in .xlsx format. Select the format and the path. Then click on the Export button.


After the extraction process, open the file


And as you can see the data I extracted was displayed properly in my excel file.

Thanks for reading!


