Home > Dashboard > GeoShell R4 > ... > GeoExtract > GeoExtract Search-Expression Library
GeoShell R4 Log In | Sign Up   View a printable version of the current page.
GeoExtract Search-Expression Library
Added by geKow, last edited by geKow on Jun 17, 2004
Labels: 
(None)

Under Construction

Here is a collection of the settings used to make GeoExtract extract various bits of information from the Internet. Feel free to contribute your own settings to help build this library, so other users can benefit from your work.

See also GeoExtractAsLogTailer for a clever use of GeoExtract.

Caution!
When copying and pasting from this page, bear in mind that line breaks in the search expressions are probably not actually present--they are likely only there due to the need to wrap the text to fit in the table cell. _Also watch out for spaces which are filled in by the wiki engine before and after the cell content. (You need to delete them to make the expression work!)_


!!!News

Top three headlines from slashdot

Host slashdot.org
Absolute Path /
Search Expression
/slc.gif%!>%*</B>%" - %"%!/slc.gif%!>%*</B>%" - %"%!/slc.gif%!>%*</B>%" - %"

Three most recent headlines from the www.heise.de newsticker ''(german IT news)''

Host www.heise.de
Absolute Path /newsticker/
Search Expression
 <A HREF="/newsticker/data/%!">%*</A>%!%" -- %"%!<A HREF="/newsticker/data/%!">%*</A>%!%" -- %"%!<A HREF="/newsticker/data/%!">%*</A>%" -- %"%!

_Alternative www.heise.de, Newsheadlines_

Host www.heise.de
Absolute Path /
SE
<!-- MITTE (LOGO + DATUM) -->%!<tt>%*</td>%!SIZE="+1">%" - %"%!/">%*</A>%"   +   %"%!SIZE="+1">%" - %"%!/">%*</A>%"   +   %"%!SIZE="+1">%" - %"%!/">%*</A>%"   +   %"

_Top three headlines from Financial Times Deutschland_ ''(german buisiness news)''

Host www.ftd.de
Absolute Path /
Search Expression
 %"   +++ FTD:   %"XL-bold%!<B>%*</B>%" ---- %"%!XL-bold%!<B>%*</B>%" ---- %"%!XL-bold%!<B>%*</B>%"

_Top three headlines at Die Welt online_ ''(german world news)''

Host www.welt.de
Absolute Path /service/newsticker/meldungen.htx
Search Expression
%"   +++   %"selected_meldung" ID="">%*</A>%"  %"%!TARGET="selected_meldung">%*</A>%"   +++   %"%!selected_meldung" ID="">%*</A>%"  %"%!TARGET="selected_meldung">%*</A>%"   +++   %"%!selected_meldung" ID="">%*</A>%"  %"%!TARGET="selected_meldung">%*</A>

_Top 3 Headlines from Google News_

Host news.google.com
Absolute Path /
Search Expression
 %" Google News: %"%!</noscript>%!<a href=%!title="%*">%" - %"%!<td width=80 align=center valign=top>%!<a href=%!title="%*">%" - %"%!<td width=80 align=center valign=top>%!<a href=%!title="%*">

_Top 4 Kuro5hin.org headlines_

Host www.kuro5hin.org
Absolute Path /backend.rdf
Search Expression
%!<link>http://www.kuro5hin.org/</link>%!<link>http://www.kuro5hin.org/</link>%!<title>%*</title>%"  --  %"%!<title>%*</title>%"  --  %"%!<title>%*</title>%"  --  %"%!<title>%*</title>%"          %"

_Top 3 Headlines from Sci-Fi Wire_

Host www.scifi.com
Absolute Path /scifiwire/
Search Expression
 %"!SciFi Wire: %"%!<!-- HEADLINE: %*-->%"- %"%!<!-- HEADLINE: %*-->%"- %"%!<!-- HEADLINE: %*-->

!!!!GeoShell

_Be alerted to the RecentEdits @ the !GeoWiki_

Host geoshell.sourceforge.net
Absolute Path /!GeoWiki/!RecentEdits
Search Expression
<li class="rc-%!>%*</li>

_The 5 newest posting headlines at geoshellx.com_

Host geoshellx.com
Absolute Path /
Search Expression
%"   +++ GSx board:  %"Latest Posts..%!">%*</a>%!by:%!">%" - %"%*</a>%" + %"%!">%*</a>%!by:%!">%" - %"%*</a>%" + %"%!">%*</a>%!by:%!">%" - %"%*</a>%" + %"%!">%*</

!!!Weather

_Current temperature in New York City_

Host www.theweathernetwork.com
Absolute Path /cities/us/new_york_NY.htm
Search Expression
Guardia Airport%!"stlt">%*<%!Temperature%!"stlt">%*<

!!!Sports

_Next hockey game_

Host www2.sportsnet.ca
Absolute Path /nhl/TOR/
Search Expression
Next game:%"Next Leaf game:%"%*<
''In the Absolute Path, change "TOR" to your team's three-letter abbreviation.''

!!!Other

_Formatted Ten most recent files added from fileforum_

Host fileforum.betanews.com
Absolute Path /
Search Expression
LATEST RELEASES STARTS HERE%"( File Forum Recent Files )  ::   %"%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"  :  %"%n%!size="3"%!<b>%*</b>%!</a>%"      %"

%%% Should work fine by cutting and pasting now, thanks GeKow

_Three most recently reported vulnerabities from !SecurityFocus Online_

Host online.securityfocus.com
Absolute Path /
Search Expression
 lities.gif%!bulletlink%!>%*</a>%" - %"%!-->%!bulletlink%!>%*</a>%" - %"%!-->%!bulletlink%!>%*</a>%" - %"

_Three most recently reported advisories from !SecurityFocus Online_

Host online.securityfocus.com
Absolute Path /
Search Expression
advisories.gif%!bulletlink%!>%*</a>%" - %"%!-->%!bulletlink%!>%*</a>%" - %"%!-->%!bulletlink%!>%*</a>%" - %"

_The first somewhat 1300 digits of pi_

Host www.joyofpi.com
Absolute Path /pi.htm
Search Expression
 ="6">%*<%!>%* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %* %*

_The last played song at 1.live_ ''(german radio station aka wdr1)''

Host www.einslive.de
Absolute Path /diemusik/dieplaylists/die_letzten_12/
Search Expression
<TR><TD valign="top" class="cont">%*</TD>%!cont">%*</TD>%!contbold">%": %"%*</TD>%" - %"

_Follow an Auction at ebay.com_

Host cgi.ebay.com
Absolute Path /ws/eBayISAPI.dll?ViewItem&item=1385063494
Search Expression
%"   +++   %"<title>%*</title>%!Currently%"   Currently:  %"%!<b>%*width%!Time%!<b>%" Time left: %"%*width
''You need to adjust the absolute path to the page you are looking for!''

_Auktion auf ebay.de_

Host cgi.ebay.de
Absolute Path /ws/eBayISAPI.dll?ViewItem&item=1385749415
Search Expression
%"   +++   %"<title>%*</title>%!Gebot%"  Aktuelles Gebot: %"%!<b>%*width%!Zeit%!%" Verbleibende Zeit: %"<b>%*Stunden,%"std.%"%*Minuten%!+%"min. %"
''absolute path muß angepasst werden!''%%% There is a strange single "e" at the end of the line, wich i didn't catch till know... I will look for it later

_Internet Traffic Report_

Host www.internettrafficreport.com
Absolute Path /history/169.htm
Search Expression
Packet Loss%!<b>%*</b>%" --- Quality: %"%!<b>%*</b>%"/100   Response time: %"%!<b>%*</b>%" ms   Packet loss: %"%!<b>%*</b>%"%%"
''Go to http://www.internettrafficreport.com and select the router closest to you.  Use that HTML page as your Absolute Path (the one given here is for Ontario).''

Site powered by a free Open Source Project / Non-profit License (more) of Confluence - the Enterprise wiki.
Learn more or evaluate Confluence for your organisation.
Powered by Atlassian Confluence, the Enterprise Wiki. (Version: 2.3 Build:#641 Jan 13, 2007) - Bug/feature request - Contact Administrators