Break Google captcha

Here I’m gonna write how I did to break Google captcha or “automatically bypass” the Google captcha to let one of my online tools (Google Parser) run with a lot of requests and without my intervention.

You probably know about the Google Sorry error page 503, next I’m going to write how to solve and bypass it.

.
First: What’s the problem to solve?

I have an online tool that does requests to Google and gets the search results. When it does too much requests Google ban it and I need to write the letters in the Google captcha to can continue to doing the requests.

Google Error

.

Second: How Google captcha ban works?

In a few words, when Google receive a lot of requests(there are a lot of another variables) from the same IP, it supposes that the requests are being done by an automatic script or spyware. Then Google ban that IP at least you write the letters of the captcha. If you write the correct letters Google returns a cookie to you that means “I’m a human, give me the search results” and then you can continue doing Google requests.

.

Third: Programming the solution…

The solution to “break Google captcha” is nothing difficult nor brilliant, just showing the captcha to the user who’s using the tool, letting him to write the letters, sending this to Google and saving the cookie to continue with the requests.

Google captcha defeated

This is the final solution, very simple, but the process wasn’t like that. To do this I had to be very carefully in the details of the HTTP requests and beat some Google tricks.

.

Fourth: The results…

Now the script is running, it can manage any amount of requests, there’s no time or number limit and the Google captcha isn’t a problem. :D

.

The phrase “break Google captcha” isn’t the most accurate for this, but I used it because this post is part of my SEO research too…

    • Kamil Przeorski
    • February 26th, 2008

    Hello,

    In case what you gave us sometimes google block your search witout possiblity to write a captcha and unblock your search. Do you have any idea how to solve this problem? Has google many datacenters with diferent data about blocked IPs inside?

    Best Regards,
    Kamil

  1. Hi Kamil, I found some Google error responses without the possibility to write a captcha, but I realize that is only for some combinations of country and language, with another combinations it always returns me the error page to write the captcha.

    For another things Google has many datacenters but they are synchronized, they can delay more or less in synchronizing… in the case of the blocked IPs, I think that Google has all of them centralized, they can be distributed in several servers or in only one, but I think that it isn’t a very large amount of data that needs to be in many datacenters. This is just my opinion, I don’t know for sure, I never get in the Google internal network… he…

    • Kamil Przeorski
    • February 28th, 2008

    Did you tried use Google Search API ? There are any limits?

    Something interesting about this topic we can find there http://www.fiftyfoureleven.com/weblog/web-development/programming-and-scripts/apis/google-search-api

    Yahoo’s : http://api.search.yahoo.com/WebSearchService/V1/webSearch?appid=YahooDemo&query=53x%20all%20day&results=
    :P

    Kamil

  2. Kamil, I tried the Google AJAX search API, it has too many limitations.
    I didn’t use the previous SOAP API because when I want to use it Google are no longer giving new API keys.

    Thanks for the links, I will check them.

    BTW: I modify a little your second link to prevent problems with my advertisers…

    • Kamil Przeorski
    • February 29th, 2008

    “I tried the Google AJAX search API, it has too many limitations”

    What limitations do you mean?

    Kamil

  3. Basically, number of results limitations and no pagination over the rest of the results.
    A year ago when I used this API, you can only get the first 8 results for a specific search and you can’t paginate over the rest of the results.

    So if you don’t need the rest of the search results this API could be good for you.

    I just read that now the Google AJAX search API gives you the first 32 results and pagination only over them. Still the same limits but with a few more results.

    • Milan
    • July 7th, 2008

    How do you ensure that saved cookie is for the same session? Actually what do you have to save in the cookies in order to continue search. I guess you use curl library?

    Regards,
    Milan

  4. Milan, you are right.
    I use Curl library for the HTTP requests, with it I can use the same cookie for several GETs or POSTs.

    To represent a session I just use the same cookie, when I want to start a new session I delete the old cookie and done.

    Cheers

      • Roy
      • January 27th, 2011

      When you delete the old cookie would that not prevent the 302 moved page (capctha pages) all-together?

    • Milan
    • July 8th, 2008

    Thanks for the quick response!

    Is this tool (google parser) going to be available for download? I see download section but no source code yet :( I think it would be great php/curl maybe regex learning lesson.

    I would like to develop something similar. It should run multiple queries in batch mode. It doesn’t have to be fast but should be stable. In your experience, if the queries run with like 2,3 seconds timeout should I face the captcha? I would like to develop similar way for user to fill captcha input and continue with queries. The problem is that processing script should run in the background – not the same script that generates user interface.

    Regards

    • Milan
    • July 8th, 2008

    One more question ;)

    In order not to get captcha that often is it better to keep the same session (cookie) or to open new one as often as possible.

    Thanks again

  5. To prevent the Google error sorry page 503 is better to keep the same cookie but not for too long, neither use a new one at every HTTP request.

    • nicky
    • April 21st, 2010

    can u give u the gooparser code ?

    is it PHP ?

    • nicky
    • April 21st, 2010

    To do this I had to be very carefully in the details of the HTTP requests and beat some Google tricks.

    <–

    Can u tell me how ?

    I always get Bad Request when request to google :(

  6. hi
    every one
    i want to bypass captcha from the fallowing page
    http://www.google.com/mobile/youtube/
    when we open this page and click on SEND TO PHONE option
    google can send sms to our provided no 5 to 10 time
    after sending 10 messages it show us captcha
    i want to bypass this captcha
    using vb6
    if posible let me now
    thanks

    • enigma1
    • May 3rd, 2011

    I remember this problem but I saw it also recently based on some search queries. It will come up right away I guess because of the particular phrase I entered.

    I was able to bypass it using the advanced search which also does a /GET but I think some other parameters are responsible for skipping the captcha.

  1. No trackbacks yet.