> is there anyone who knows how to text-edit on html page using tools
> such as sed, awk, or perl?
> for example, in finance.yahoo.com, i find out financial statement,
> then, i want to substract only numbers that i want to put into my
> based upon your expertise, is there any way to do such things by using
> text-editing tools with regular expression? it would be great to have
> respone on how to do that, but i would be stil good to learn just
There are many ways of dealing with HTML pages; without knowing
more about what you want, it's hard to say how you should proceed.
You can retrieve the page by
lynx -source $URL > FILE.html".
or, to get a text rendering of the page:
lynx -dump -nolist $URL > FILE.html".
Then you can retrieve the lines you want with grep, pipe them to
awk or sed or cut to extract the information you want.
Chris F.A. Johnson http://cfaj.freeshell.org
My code (if any) in this post is copyright 2003, Chris F.A. Johnson
and may be copied under the terms of the GNU General Public License