I would disagree with that idea that the vast amount of irrelevant and distracting content is somehow the problem.
Believe it or not it's an incredibly challenging problem. Google is burned at the stake everyday for their biased response on searches when they're parsing disgusting amounts of data and attempting to minimize noise. It's a damned if you do and damned if you don't honestly, because even if you can effectively minimize returns for a given query (regardless of topic), there is then information loss to some degree and therefore arguably a bias.
Believe it or not it's an incredibly challenging problem. Google is burned at the stake everyday for their biased response on searches when they're parsing disgusting amounts of data and attempting to minimize noise. It's a damned if you do and damned if you don't honestly, because even if you can effectively minimize returns for a given query (regardless of topic), there is then information loss to some degree and therefore arguably a bias.