Find Jobs
Hire Freelancers

Crawler that takes products from any site in any page!

$30-250 USD

종료됨
게시됨 7년 이상 전

$30-250 USD

제출할때 지불됩니다
Hi, I need a crawler that knows to eat products from any site/page (the site owner knows about it!): 1. You get the URL of the homepage, parse it with ganon, find all the images of the homepage and their links that should be the SRC of the closest A parent backwards 2. You group the found images by one of these options IN THIS ORDER (the first that is applicable): a) Grouped by [login to view URL] b) Grouped by [login to view URL] + [login to view URL] if there are into the IMG TAG (don't try to reach width and height in PHP at this stage) c) Grouped by number of directory levels (/) into [login to view URL] && number of directory levels (/) into [login to view URL] d) Grouped by margin of similar_text of [login to view URL] && margin of similar_text of [login to view URL] e) Grouped by [login to view URL] + [login to view URL] after reaching width and height of the images via PHP 3. You start with the group of images that have the higher number of images, and you look backwards for the price and the name of the item after checking that the name looks like [login to view URL] (in case of doubt open the item page and take the TITLE but it should be rare!) 4. When you find 3 images of the same image's group that have url, name and price, that's all! You will stop the headaches and start to search in the homepage and then in all the sites ONLY the CSS classes that you have just found now 5. For every product, you find the tags from CONCAT([login to view URL], '', [login to view URL], '', NAME) - a word that appears at least twice is a tag... You release first homepage results ASAP and IN PARALLEL you can run a script that takes other links from the sitemap or from this analyze where you already have a ganon object that will give you all the links of the page easily and quickly Good luck!
프로젝트 ID: 12094258

프로젝트 정보

11 제안서
원격근무 프로젝트
활동 중 7년 전

돈을 좀 벌 생각이십니까?

프리랜서 입찰의 이점

예산 및 기간 설정
작업 결과에 대한 급여 수급
제안의 개요를 자세히 쓰세요
무료로 프로젝트에 신청하고 입찰할 수 있습니다
11 이 프로젝트에 프리랜서들의 평균 입찰은 $286 USD입니다.
사용자 아바타
Hi there! We are a team of qualified and professional Web application developers. Our expertise are PHP and javaScript based CMS and frameworks. We are dedicated to provide high end UI/UX design, reactive and fast running web applications with stable backend logics. All this we achieve through Wordpress, Joomla, Magento, Reaction commerce, Laravel 5.x, Codeigniter 3.1.x, MEAN STACK, Meteorjs, Angularjs 2.0, Reactjs, Backbonejs, Amberjs, Jquery, Ajax, Polymer Material Design, Bootstrap, CSS3, HTML5 and Javascript various libraries. We are curious to chat with you for further discussion. You may ping us any time. Thanks
$222 USD 9일에
4.9 (26 건의 리뷰)
5.5
5.5
사용자 아바타
Hello, I've read your project notes, I got what do you exactly want, I can make the software in php/mysql/JQuery/CSS. Also The application will be have an admin panel to you can be able to view the grabbed contents. Please ask me if there is any question. Thank You.
$140 USD 3일에
4.9 (22 건의 리뷰)
4.9
4.9
사용자 아바타
Hi, Thanks for the opportunity. As per your requirement, i would like to tell you that I have a very strong experience of more than 7 years in this field of design and development. Please spare a moment to discuss this project. Waiting for your message Thanks
$133 USD 4일에
4.6 (49 건의 리뷰)
5.3
5.3
사용자 아바타
Hi We are good with web mining and crawling data from http using java and other language coding. Yes we can classify the mine data and further result can be store and produce the report. Chat more please.
$200 USD 3일에
4.2 (4 건의 리뷰)
1.7
1.7

고객에 대한 정보

국기 (ISRAEL)
Petach Tikva, Israel
4.9
9
결제 수단 확인
9월 25, 2016부터 회원입니다

고객 확인

감사합니다! 무료 크레딧을 신청할 수 있는 링크를 이메일로 보내드렸습니다.
이메일을 보내는 동안 문제가 발생했습니다. 다시 시도해 주세요.
등록 사용자 전체 등록 건수(일자리)
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
미리 보기 화면을 준비 중...
위치 정보 관련 접근권이 허용되었습니다.
고객님의 로그인 세션이 만료되어, 자동으로 로그아웃 처리가 되었습니다. 다시 로그인하여 주십시오.