Scraping and indexing 1.2B emails for under $200

  • Is nobody going to mention how this is a bad thing intended for sending spam? Guess I'll have to be that person then.

  • Took me a while to understand they were scraping email addresses and not actual emails.

  • Wow, the author really calls out his competition! There are also parts 2 and 3 to this article that discusses using Rust and Postgres for their solution.

  • Help rid the world of spam! Project Honey Pot is our friend. https://www.projecthoneypot.org/

  • I must be missing something because 6.5 days * $21/day = $136.5

    """

    The entire process now took 6.5 days and cost $21/day. Our total cost all said and done was $115!

    """

  • Nice writeup.