Ilmu Komputer    
   
Daftar Isi
(Sebelumnya) Google ScholarGoogle Shell (Berikutnya)

Google Search Appliance

Google appliance as shown at RSA Conference 2008

The Google Search Appliance is a rack-mounted device providing document indexing functionality that can be integrated into an intranet, document management system or web site using a Google search-like interface for end-user retrieval of results. The operating system is based on CentOS. The software is produced by Google and the hardware is manufactured by Dell Computers and is based on Dell's PowerEdge R710.[1] Sales operate on a licensing scheme which starts as a two-year contract for maintenance, support, and software updates.

Contents

Features

The Google Search Appliance contains Google search technologies and a means of configuring and customizing the appliance. The appliance also comes with a T-shirt.[2]

Other features include

  • it supports Google Analytics and Google Sitemaps functionality
  • its search capabilities include searching web content, other file types (e.g. html, pdf, office documents), databases (Oracle, MySQL, Microsoft SQL Server, IBM DB2, Sybase) and content management systems (EMC Documentum, FileNet, Open Text LiveLink, Microsoft SharePoint)
  • indexing (crawling) of search-able content can be configured by specifying URLs to crawl. Search patterns can also be included to limit the information that is being searched and searching can be customized by using the OneBox API
  • the result set will be displayed with a Google-like appearance. The default behavior can be customized by using XSL Transformations
  • keywords that returns specific result when specific keywords are used. Example: Associate Cell Phone with http://SampleCellProvider.com so whenever someone searches for cell phone your link will appear at the top of the search no matter where it would normally appear in the result set
  • synonyms will give alternate terms for your search. E.g. when user types “cell phone”. Search will add suggestions e.g. “mobile phone” to the result set
  • cached results each result item will include a "cached" link next to each result item. By clicking on the user will be able to view an HTML version of the page / document which means that the actual document does not need to be opened
  • The result set also contains number of results returned, duration of search, document title, url of document, date modified.
  • search terms are highlighted to show search hits and allows you to see words in context without having to open documents.
  • groups similar results to hide duplicates.
  • shows document types
  • result sets can be sorted by date or relevance

Scalability

  • Multiple appliances can be linked together to scale to billions of documents.
  • Physical hardware can be distributed across multiple locations.

Administration

Minimal support infrastructure and sysadmin staff is needed as quoted on their web site “…doesn’t need a tech support baby-sitter. You simply plug it in, configure it, and let it run…”. The device does come with a web based administrative console that can be used to make configuration changes where needed. Additional customisation is possible through a Representational State Transfer (REST) API that allows for automation of tasks. There are also existing modules that can be used for customization.

Newest software

Software version 6.0 was released in June, 2009. This software runs on some hardware versions of the GB-1001 model (all units with an "S5" prefix in their "Appliance ID"), and all GB-7007 and GB-9009 models. New features available in this software include:

  • Customized and enhanced relevancy tuning to bias certain nodes’ and collections’ results.
  • Administration APIs for .net and Java programmers to automate tasks.[3]
  • Early binding to increase serving performance.
  • Customization in SAML authentication and Authorization.
  • Added user results to search results.
  • Search-as-you-Type functionality.
  • Query translation to 40 different languages.
  • Replication of search results.
  • Clustering multiple GSAs by using a new technology called (GSA)n makes it possible to index up to 1 billion documents.[citation needed]

Models

The Google Search Appliance can be purchased in two separate versions based on the number of documents being indexed. Model GB-7007, a 2U appliance, can index up to 10,000,000 documents. The GB-9009 5U appliance[4] can index up to 30,000,000 documents.

Discontinued versions

Older appliances

Google used to sell a 1U appliance (GB-1001) capable of indexing up to 5,000,000 documents, a half-rack cluster (GB-5005) of five 2U nodes capable of indexing up to 10,000,000 documents, and a full-rack cluster (GB-8008) of eight and later twelve nodes capable of indexing up to 30,000,000 documents.[4] Some models were based on Dell PowerEdge 2950 2U rackmount servers.

Google Mini

The Google Mini was a smaller and lower-cost solution for small and medium-sized businesses to set up a search engine that allowed them to index and search up to 300,000 documents.[5] As part of Google's spring cleaning 2012 the Google Mini was discontinued beginning July 31, 2012.[6]

Google Search Appliance virtual edition for developers

For a brief period in 2008 Google offered a virtual version of the Google Search Appliance aimed at developers. The virtual edition could be downloaded free of charge and index up to 50,000 documents.[7] It was soon discontinued for unknown reasons.

Product availability

The Google Search Appliance is available in the United States, Canada, Europe, Japan, parts of Asia, the Middle East, North Africa and South America. If a person is interested in using the Google Search Appliance in another region, they can deploy the Google Search Appliance at a location or data center in the US, Canada, or Europe.

Criticism

Even though Google search and Google Search Appliances have proven to have many advantages for organizations implementing them, some business analysts have suggested[8] that Google Search Appliances may introduce two risks: for breaching the privacy acts, and for exposing the organization to commercial security risks.

References

External links

(Sebelumnya) Google ScholarGoogle Shell (Berikutnya)