Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)



On 2005-05-05, Stuf4 <tdadamemd-spamblock-@xxxxxxxxxx> wrote:
>>>From dps:
>> So how did I find 230 references to "oxygen tank"?
>
> In my original statement that "I have not been able to find a single
> source of searchable text of that report", what I meant was that:
>
> The report has not been crawled and indexed by popular search engines,
> so it is lacking significant accessibility.

A common cry in the library world these days is that people expect
everything to be available on Google. I hadn't realised quite how much
some took this for granted, though...

[A 2003 study estimated the "deep web" - the dynamic stuff, often
ignored by search engines or hidden in databases - to be about 500 times
the size of the "surface web". Even web-accessible data is very often
not searchable by a major engine...]

That post I wrote yesterday? I skimmed 3-500 pages of pdf'ed text, and
read about a hundred in detail. It's not difficult.

--
-Andrew Gray
andrew.gray@xxxxxxxxxxxxx
.



Relevant Pages

  • Re: Reporting Stolen Content
    ... >> Sometimes the better place to report stolen content and copyright ... Isn't Blogspot part of Google now? ... >> Search engines will probably just drop the reported site from their ... help lower search engine dependence. ...
    (alt.internet.search-engines)
  • Re: Strange Google
    ... WILL look at the others) then you start grovelling to Google. ... I'd suggest you fix things PDQ before the other search engines ban you. ... In case a report has been filed to several search engines already (the ban is ...
    (alt.internet.search-engines)
  • Re: Site Reporting
    ... engine report, and their further activity is reported as originating ... The situation is identical with pages and hits originating from links. ... is it safe to assume that the real figure is 60% search engines, ...
    (alt.internet.search-engines)
  • Re: Site Reporting
    ... engine report, and their further activity is reported as originating ... The situation is identical with pages and hits originating from links. ... is it safe to assume that the real figure is 60% search engines, ...
    (alt.internet.search-engines)
  • Site Reporting
    ... engine report, and their further activity is reported as originating ... The situation is identical with pages and hits originating from links. ... if I have 15% orginating from Search engines and 10% ...
    (alt.internet.search-engines)

Loading