Extracting Google-Indexed Web Site Pages Using MS Excel - Understanding the Google Search Result
(Page 2 of 4 )
When you visit http://www.google.com to find the indexed pages of your website, it is not as easy as typing in your domain name and let Google provide you a complete list of indexed pages. You have to use proper syntax in order for Google to give the complete list of URLs.
The indexed pages of your domain can be determined by entering this syntax in the Google search box and then pressing "Google Search" site:domainname.com
For example, if your website is http://www.thisisyourwebsite.com , you have to enter it as: site:thisisyourwebsite.com (not including the www).
We do not include the www in the search query so we can capture all URLs, including the other sub-domains in your website (not only those that start with "www").
Below is the screen shot of the Google search result (circled in red are the indexed URLs which you will need to extract; the other results are irrelevant to your site):
Also note that the Google search result by default shows only the first ten results. Extracting the indexed URLs of your site from the Google search result to your Excel spreadsheet involves the copy and paste method. So we need to change this to the maximum (which is 100), which will speed up the process. Follow this simple procedure:
Step 1: Enter site:thisisyourwebsite.com in the Google search box.
Step 2: Beside the Google search box, click "preferences."
Step 3: In the section for number of results, change it from 10 to 100.
Step 4: Click "Save Preferences."
Next: The Process >>
More BrainDump Articles
More By Codex-M