|
|
A UFOdexSM Search works just like
a typical web search. When we find a book, video, image, periodical or other category
of UFO information whose content contains a match for your search terms, we will
link to it in your search results. Clicking on a result, you will be able to see
everything from a few short excerpts to a limited number of book pages, depending
on a few different factors. For the first several years the goal of UFOdexSM
will be to continue to gather and integrate more and more UFO knowledge online,
there is no expectation of a profit. Contact: Ryan Wood at
rswood@igc.org
A UFOdexSM Search works just like
web search. When we find a book, video, image, periodical or other class of UFO
information whose content contains a match for your search terms, we will link to
it in your search results. Clicking on a result, you will be able to see everything
from a few short excerpts to a limited number of book pages, depending on a few
different factors.
Although UFOdexSM does connect you
to the current web like conventional search engines, it provides much, much more
for the serious searcher. UFOdexSM also connects
you to the Deep-Web (see "What is the Deep-Web below) where there resides a much
larger repository of information relevant to UFOs. UFOdexSM
captures material from the National Archives, Library of Congress, FBI files, Presidential
Libraries and related archival resources (Archives Search). In the Periodical Search
all the UFO journals, magazines, newspapers are searched to find the requested content.
With the Image Search feature, all the images that match the criteria are displayed
including known UFO content advertising and comics. In the Audio Search the audio
content has been transcribed and indexed so that the relevant two minute audio section
is displayed along with associated key text. The Video Search works in much the
same way displaying both the text and associated two-minute video snapshot.
Of special significance, UFOdexSM accesses a
collection of unique, often out-of-print books, periodicals and rare UFO materials.
UFOdexSM will provide you search results within
these materials, then assist you in obtaining the full text of these books and periodicals
where they may be still available in hard-to-find sources.
Searching on the Internet today can be compared to dragging a net across
the surface of the ocean. While a great deal may be caught in the net, there is
still a wealth of information that is deep, and therefore, missed. The reason is
simple: Most of the Web's information is buried far down on dynamically generated
sites, and standard search engines never find it.
Searching on the Internet today can be compared to dragging a net across
the surface of the ocean. While a great deal may be caught in the net, there is
still a wealth of information that is deep, and therefore, missed. The reason is
simple: Most of the Web's information is buried far down on dynamically generated
sites and standard search engines never find it.
Traditional search engines create their indices by spidering or crawling surface
Web pages. To be discovered, the page must be static and linked to other pages.
Traditional search engines can not "see" or retrieve content in the Deep-Web -those
pages do not exist until they are created dynamically as the result of a specific
search. Because traditional search engine crawlers can not probe beneath the surface,
the Deep-Web has heretofore been hidden.
The Deep-Web is qualitatively different from the surface Web. Deep-Web sources store
their content in searchable databases that only produce results dynamically in response
to a direct request. But a direct query is a "one at a time" laborious way to search.
With this in mind, search engine experts have quantified the size and
relevancy of the Deep-Web in a study based on data collected between March 13 and
30, 2000. Our key findings include:
- Public information on the deep Web is currently 400 to 550 times larger than
the commonly defined World Wide Web.
- The deep Web contains 7,500 terabytes of information compared to nineteen terabytes
of information in the surface Web.
- The deep Web contains nearly 550 billion individual documents compared to
the one billion of the surface Web.
- More than 200,000 deep Web sites presently exist.
- On average, deep Web sites receive fifty per cent greater monthly traffic than surface
sites and are more highly linked to than surface sites; however, the typical (median)
deep Web site is not well known to the Internet-searching public.
- The deep Web is the largest growing category of new information on the Internet.
deep Web sites tend to be narrower, with deeper content, than conventional surface
sites.
- Total quality content of the deep Web is 1,000 to 2,000 times greater than
that of the surface Web.
- Deep Web content is highly relevant to every information need, market, and domain.
- More than half of the deep Web content resides in topic-specific databases.
- A full ninety-five per cent of the deep Web is publicly accessible information —
not subject to fees or subscriptions.
To put these findings in perspective, a study at the NEC Research Institute,
published in Nature Journal estimated that the search engines with the largest number
of Web pages indexed (such as Google or Northern Light) each index no more than
16% of the surface Web. Since they are missing the deep Web when they use such search
engines, Internet searchers are therefore searching only 0.03% - or one in 3,000
- of the pages available to them today. Clearly, simultaneous searching of multiple
surface and deep Web sources is necessary when comprehensive information retrieval
is needed.
What is the ultimate scope and completeness of UFOdexSM.net?
The scope and completeness will vary based on the rate of money that
is raised for the accusation and integration of UFO data and knowledge. The mining
of the Deep-Web and surface web will be quick, relentless and effective after the
purchase of a software tool called the Deep Query Manager (DQM).
The scanning and indexing of books is relatively straightforward; they need to be
purchased or photocopied then scanned and optically character recognized (OCR'ed).
Next the book will have metatags, keywords and relevancy attributes appended to
it for optimal searching and presentation to the customer.
In the case of images, they will need to be labeled, sourced and in some cases have
small abstracts about them created. This process will occur in part from the scanning
of books and mining the deep web.
In the case of DVD's, videos and audio programs they will need to be transcribed
then put into chapters so that when a search is made the relevant visual scene and
dialogue context is properly served up to match the request.
For the Demonstration Project -- The scope and completeness will be about 90,000
pages of documents and demonstrate functionality in all the major categories of
UFOdexSM (Books, Deep-Web, Video, Audio, Periodicals,
Archives, and Images). It will be available online to a limited number of users
and researchers that are willing to provide feedback about usability and effectiveness.
For the Pilot Project -- The scope and completeness will be greater than 250,000
pages and include a credible response to the 100 most common queries that are posed
to UFOdexSM across multiple search categories.
For the Full Release Implementation -- The scope and completeness will be a minimum
of a million pages and provide credible responses to 1000 of the most common search
queries.
For Ongoing Maintenance -- The DQM will be updating surface web and Deep-Web content
daily, along with scanning and integrating new books, videos and related information.
Google searches the surface web and according to the search statistics
only crawls and caches about 16% of the available surface web. The other search
engines, Yahoo, MSN, etc. are in the same range or lower. No Deep-Web penetration,
no access to databases behind web search boxes there are 200,000 of them. Although
Google Books and Google Video are available, they offer again a shallow swath of
information. UFOdexSM is focused on just the
UFO information, all of it and nothing else.
There really is no comparison, and in many respects there is no competition.
UFOdexSM is a tool that will give you even better
access to the information in other UFO sites and will provide you with the rich
context from other sources that will make the conventional UFO sites even more meaningful.
The operators of such sites should find that their own results improve, due to referrals
from in-depth searches provided by UFOdexSM.
For a very good website like www.ufoevidence.org or the superb news site like www.earthfiles.com
--both of which reside on the "surface web"-all the content will be completely indexed
by UFOdexSM then categorized, integrated and
presented as part of the full search results provided by UFOdexSM.
Sites like www.mufon.com, www.fbi.gov, www.blackvault.com or www.bluebookarchive.org
will be targeted by the Deep Query Manager and hundreds, or perhaps thousands, of
searches will be performed to harvest all the knowledge contained in their databases.
In short, there is no comparison in terms of the capability to do full research
on a UFO topic. Conventional UFO sites will have their content even better accessible
to users. Plus, everything that is on the web, the UFOdexSM
proprietary database (such as full-text books), and ultimately in the Library of
Congress will be at your fingertips.
Sophisticated search technology automates the process of making dozens
of direct queries simultaneously using multiple-thread technology. This is the only
search technology, so far, that is capable of identifying, retrieving, qualifying,
classifying, and organizing both "deep" and "surface" web content.
If the most coveted commodity of the Information Age is indeed information, then
the value of Deep-Web content is immeasurable. UFOdexSM
researchers will be able to employ the same level of rigor and thoroughness as do
many of the best corporate, intelligence and defense researchers online.
|