Lenat's Bootstrap Hypothesis: once Cyc reaches a certain scale it can help in its own development and start using natural language to augment its knowledge base
Splunk sucks up every type of log you care to feed it, indexes them, and then makes them easily searchable via a nifty AJAX-enabled web interface.
hooch
.. you can feed it all sorts of logs including Apache, Microsoft IIS, JBoss, Windows Event Logs, Sendmail/Postfix/Qmail, OpenLDAP, Active Directory, etc, etc, etc.