CSUMB Presidential Transition

After years of using an open-source solution called htDig, built by our colleagues at San Diego State University, we have moved to Google Mini as our new solution for searching official CSUMB.EDU pages. You can access the new search engine from the same location, just above the Quicklinks pull-down menu on any page using the wave template.

During spring 2005, WebPAT recommended the purchase of Google Mini as our new search engine to better search pages created within our Charlotte content management system and to take advantage of advanced Google features such as KeyMatch Synonyms that create controllable search results.

We crawl every six hours (6 a.m., noon, 6 p.m. and midnight), following links from pages within CSUMB.EDU. The average number of URLs crawled is over 44,000 and takes about two hours to complete. The total size of the stored documents is over 2,500 MB.

To keep under the Google Mini limit of 100,000 pages, we crawl only the CSUMB.EDU root domain plus department domains (e.g. hcom.csumb.edu, af.csumb.edu). We do not crawl data websites like Resource25, Planner, and the Phone Directory. We also do not crawl the Home and Classes servers, so the search results will not generate links to student, staff, faculty, and class-specific pages.

Tips For CSUMB Publishers