
The AI uBlock Origin Blacklist is a personalised collection used with uBlock Origin to block AI content farms. Contributions are encouraged. To install, you can either subscribe automatically through the provided link (requires uBlock Origin) or import a specified URL as a third-party list. The rationale for this blacklist is to filter out websites that predominantly feature AI-generated text lacking valuable information and flooded with ads and referral links.
The core concept is straightforward: when searching online, the preference is for human-created content over AI-generated content due to the former’s depth, insights, creativity, and reliability. AI-generated content can pose risks, especially when unchecked before publication on content farms. These sites may inadvertently provide harmful advice or misinformation. Hence, the manual curation of entries is preferred over automated methods to ensure accuracy in identifying AI-generated pages.
While some may argue that the list is limited in scope, it has proven effective in blocking problematic sites from search results due to their aggressive SEO tactics. The bias towards certain regions in the entries emphasises the necessity of diverse contributions to enhance the blacklist’s coverage.
To add websites to the list, non-technical users can report suspicious sites by creating an issue. For those familiar with GitHub, guidelines are provided on how to identify and include offending domains or specific pages within the list file. Recognisable patterns of content farms are outlined to help in identifying potential additions to the blacklist.
Additionally, methods like using Google Dorks and analysing SEO backlink spreadsheets are suggested for identifying AI-generated pages efficiently. It’s emphasised that caution should be exercised when adding sites based on these methods to avoid false positives. Similar projects focusing on blocking AI-related results are mentioned, along with an open invitation for contributions to enhance the blacklist based on personal findings of completely AI-generated websites.