l i n u x - u s e r s - g r o u p - o f - d a v i s
Next Meeting:
July 7: Social gathering
Next Installfest:
Latest News:
Jun. 14: June LUGOD meeting cancelled
Page last updated:
2003 Jun 11 19:17

The following is an archive of a post made to our 'vox-tech mailing list' by one of its subscribers.

Report this post as spam:

(Enter your email address)
Re: [vox-tech] Parsing Html
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [vox-tech] Parsing Html

On Wednesday 11 June 2003 02:54 pm, Mike Simons wrote:
> On Wed, Jun 11, 2003 at 04:00:06PM -0500, Jay Strauss wrote:
> > Found  HTML::TableContentParser which does some of the heavy lifting for
> > me playing with it now
> >
> > > http://quote.cboe.com/QuoteTable.asp?TICKER=qqq&ALL=2
> > >
> > > It seems like there would be a cpan thing to read in a string (html),
> > > then would let me navigate.  That is, give me the third table, give me
> > > the first row, give me the first table data
>   save the html into a file with wget, then feed that as an argument to
> the perl below... if you want the calls and puts broken into separate
> arrays or into hashes it should be easy from here.

Silly question, why not open the wget in the open call ?  Ie:

open FOOF, "wget -q -O - <url> |"; 
while (<FOOF>) { 
$file .= $_;
$_ = $file; 

I realize using something fun like LWP would be better, but this would be the 
poor mans way of doing it I would think...


Mike Wenk
vox-tech mailing list

LUGOD Group on LinkedIn
Sign up for LUGOD event announcements
Your email address:
LUGOD Group on Facebook
'Like' LUGOD on Facebook:

Hosting provided by:
Sunset Systems
Sunset Systems offers preconfigured Linux systems, remote system administration and custom software development.

LUGOD: Linux Users' Group of Davis
PO Box 2082, Davis, CA 95617
Contact Us

LUGOD is a 501(c)7 non-profit organization
based in Davis, California
and serving the Sacramento area.
"Linux" is a trademark of Linus Torvalds.

Sponsored in part by:
Sunset Systems
Who graciously hosts our website & mailing lists!