Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: "Stuf4" <tdadamemd-spamblock-@xxxxxxxxxx>
- Date: 8 May 2005 08:56:19 -0700
>>From Andrew Gray:
> On 2005-05-05, Stuf4 <tdadamemd-spamblock-@xxxxxxxxxx> wrote:
> >>From dps:
> >> So how did I find 230 references to "oxygen tank"?
> >
> > In my original statement that "I have not been able to find a
single
> > source of searchable text of that report", what I meant was that:
> >
> > The report has not been crawled and indexed by popular search
engines,
> > so it is lacking significant accessibility.
>
> A common cry in the library world these days is that people expect
> everything to be available on Google. I hadn't realised quite how
much
> some took this for granted, though...
>
> [A 2003 study estimated the "deep web" - the dynamic stuff, often
> ignored by search engines or hidden in databases - to be about 500
times
> the size of the "surface web". Even web-accessible data is very often
> not searchable by a major engine...]
Of course Google does not come close to canvasing the entire web. This
is the battle cry of all their competitors. And yes, we could make
those stats look even more pitiful if we were to take into account
blogs and such...
But what we are talking about here is not some dynamic document that
was created yesterday. We are talking about an official report that
was published in 1970. A report that was to investigate the wasting of
millions of US taxpayer dollars (assuming that those tax dollars were
spent on actually landing). We're talking about a report on an event
that was made into a megabuck Hollywood movie.
....yet no one, as far as I'm aware, has made the contents of this
report available through simple popular search.
> That post I wrote yesterday? I skimmed 3-500 pages of pdf'ed text,
and
> read about a hundred in detail. It's not difficult.
Perhaps you'd like to demonstrate how a person can do a search on
certain keywords and get a hit from the contents of the Apollo 13
report.
I've found a few pages of front matter that have been made crawlable.
But that's far from the majority of the report. And that's far from
the meat of the report.
The goal I was suggesting was to have the entire report crawled for
anyone doing a simple search. As it stands today, those people have to
do extra work to get to it. (And that's assuming that they know that
it's there in the first place.)
~ CT
.
- Follow-Ups:
- References:
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Stuf4
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Stuf4
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Rusty
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Stuf4
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: snidely
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Scott Hedrick
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Pat Flannery
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Jorge R. Frank
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: snidely
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Stuf4
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- From: Andrew Gray
- Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- Prev by Date: Re: 3/4 OT: Hitchhiker's Guide to the Galaxy?
- Next by Date: Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- Previous by thread: Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- Next by thread: Re: now ~CT-infested -- Re: Q: For Sy Liebergot (Yeah, it's new safe CT-free topic fodder!)
- Index(es):
Loading