The parser property avito (version 1.0)

Affiliates: 0,14 $how to earn
Pay with:
i agree with "Terms for Customers"
Sold: 0
Uploaded: 21.04.2014
Content: 24,95 kB

Product description

Парсер недвижимости с сайта avito (версия 1.0):
- возможность парсить несколько страниц раздела недвижимости
- PHP код парсера содержит качественные комментарии и будет понятен большинству программистов и администраторов

Additional information

The parser property website avito (version 1.0)

The script is implemented in accordance with the following terms of reference:

1. Problem:
Write a parser for the site «Avito» for «Real Estate - Residential - Purchase":

2. Requirements:
The parser must be written in PHP5.

3. References:
Pars site need after the collection of links and identifying new products.
It should be stored in a reference file avito.txt, which continually add new links found on the site. Identification of new products is done by comparing the found links with existing file avito.txt. New links must be parsed, and has repeated skipped. After parsing the link should be recorded in a file avito.txt, for further comparison.
File format avito.txt:
[Link] [link to the date of detection of the site]

4. The format parsed information:
Parsed ads should be stored in text format in the umbilical cord:
[Source] [Agenstsvo or Private] [phone] [area] [Street] [House] [Number of rooms] [total area] [living space] [kitchen area] [floor] ; [storeys] [material] [price] [contact] [Comment], [link]
avito;Частное;83482743;Ленинский;Ватутина;12;2;54;35;8;2;5;кирпич;2700;Игорь Petrov; renovated; http: //

Note: all ads first field is always «avito».
If information is not provided (for example, no house numbers), you must leave the field blank, ie, leave semicolon.

5. Report of the parser:
Upon completion of the parser is necessary to generate a report about parsed information in the form (report - a text file with one line):
[Number of references at the time of parsing] [amount collected links] [number of new products] [number rasprasennyh novelties]
22437; 22421; 146; 145
Report formirovt better in a separate folder «Reports», and his name should correspond to the date and time of report generation in such a way (for example, the formation of a report in 2014 on March 12 the time - 14 hours 30 minutes 15 seconds): 20140312_143015
[Year] [month] [date] _ [hours] [minutes] [second]


No feedback yet.
In order to counter copyright infringement and property rights, we ask you to immediately inform us at the fact of such violations and to provide us with reliable information confirming your copyrights or rights of ownership. Email must contain your contact information (name, phone number, etc.)