Replik auf Library Hi Tech paper

Michael K. Bergmans Replik “The murky depths of the ‘deep web’” auf unser Invisible Web paper.

As noted, I generally agree with these criticisms. For example, since the time of original publication, we have seen the power distribution nature of most things on the Internet, including popularity and traffic. Exponential distributions will always result in overestimates using calculations based on means rather than medians. I also think that meaningful content types were both overused (more database-like records) and underused (PDF content that is now routinely indexed) in my original analysis.

However, the authors’ third criticism is patently wrong, since three different methods were used to estimate internal database record counts and the average sizes of each record they contained. …

0 Responses to “Replik auf Library Hi Tech paper”


  1. No Comments

Leave a Reply

You must login to post a comment.