I am working on a simple static website that gives visitors basic information about myself and the work I do. I want this as a way use to introduce myself to potential clients, collaborators, etc., rather than rely solely on LinkedIn as my visiting card.

This may seem sound rather oxymoronic given that I am literally going to be placing (some relevant) details about myself and my work on the internet, but I want to limit the websites’ access from bots, web scraping and content collection for LLMs.

Is the a realistic expectation?

Also, any suggestions on privacy respecting, yet inexpensive domains that I can purchase in Europe would be of super great help.

  • refalo@programming.dev
    link
    fedilink
    arrow-up
    2
    ·
    edit-2
    5 months ago

    Blocking non-Mozilla user agents has eliminated 99% of scraping in my experience. I’ve seen a few larger sites do it as well but not many.