Infopost | 2024.08.09
|
Vincent Schmalbach | Google is no longer trying to index the entire web. In fact, it's become extremely selective, refusing to index most content. This isn't about content creators failing to meet some arbitrary standard of quality. Rather, it's a fundamental change in how Google approaches its role as a search engine. |
crazygringo |
This post seems to be based entirely on personal anecdotal experience. There isn't a shred of hard data to support the headline claim that Google now "defaults to not indexing content". Google never indexed everything, removing duplicates, blogspam, useless pages, etc. Maybe they've changed their thresholds or maybe not. But this post provides zero evidence of anything. It's pure speculation without any facts at all. |
Vincent Schmalbach |
You're right, this post is based on personal anecdotal experience. I have access to Google Search Console data for over 100 websites, and most have many pages in the "Discovered - currently not indexed" and "Crawled - currently not indexed" categories, despite ranking well for some keywords and getting traffic. This wasn't the case 10 years ago. Regarding "Google never indexed everything" - I'd say it came close. They did manual de-indexing for heavy spam sites and would even send an email when they did this. Apart from that, nearly everything was in the index, including duplicates. De-duplication happened at the ranking stage, not the indexing stage. |
aiauthoritydev |
I think the problem is not Google specific rather the internet has grown far too large with too much of crap floating around. Google, in my opinion has done the best job of getting relevant information followed by Reddit. While OpenAI etc. is pretty good (so does Google Gemini) what is OpenAI like interfaces prevent me from doing is to segue from a focused topic to related areas to discover knowledge on the periphery, which is the most important aspect of learning in my opinion which chatbots today are not able to do that well. |
◄ |
2024.08.05
Second lapRemnant II and BG3 are just as good the second time around. |
2024.08.18
CoastsCatalina and then some gaming in NoVa. |
► |
2024.05.06
WanderingPurposeful and aimless walks into the internet. |
2023.11.20
SubsurfaceLinks pages, webrings, and search. |
2023.07.23
A walk in the dark forestAll of the internet in one short stroll. |
www.vincentschmalbach.com
Google Now Defaults to Not Indexing Your Content - Vincent SchmalbachPicture this: It's ten years ago, and you've just launched a new WordPress blog. Within hours, sometimes even minutes, your content is indexed by Google. |
gehrcke.de
Google changes: recently I see more of "discovered - currently not indexed" - Jan-Philip Gehrcke, PhDHas Google become more conservative with indexing content of personal websites? I think we might see less and less low-traffic quality contents in Google search results. I have carefully done basic... |
www.theverge.com
How Google perfected the webGoogle has dominated the search market for decades, leading to a web filled with SEO-driven content. With generative AI on the horizon, this could all come crashing down. |