How to do web-scraping of contents in TripAdvisor using YQL -
i quite new web scraping question might little simple bothers me lot. want scrap contents tripadvisor, when run following command in yql, returns nothing.
select * html url="http://www.tripadvisor.com/search?q=sunny+relax&geo=191#&ssrc=a&o=0.html"
can tells me why? there wrong commands?
thank in advance kind help.
it because "/search" page disallowed in http://www.tripadvisor.com/robots.txt , yql checks in robots.txt.
you can try page , use xpath select nodes, e.g.:
select * html xpath = '//div[@class="listing_title"]/a' , url = 'http://www.tripadvisor.com/hotels-g45963-las_vegas_nevada-hotels.html'
Comments
Post a Comment