Poor Man’s Google Scrape Technique

How to scrape for links on Google using notepad..

This requires cygwin or some sorta shell interpreter with grep/sed capability.

1) Search for whatever you wish.  Make sure you get 100 results per page (under “advanced search”)

2) Copy paste the entire screen

  • Ctrl+A  - this selects all text
  • Ctrl+C – copies text onto buffer
  • Open Notepad (or your fav. text editor), press Ctrl + V – pastes text onto editor
  • Save (i.e. urls.txt).

3) Run this command on the file

 grep “Cached” urls.txt | sed -e ‘s/ – .*//g’ -e ‘s/\?.*//g’ -e ‘s/^      //g’ | sort | unq

 

Voila

Related posts:

  1. How I built RedMol.com

Leave a Reply

Your email address will not be published. Required fields are marked *

*


You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>