|
#1
| ||||
| ||||
| Stripping HTML tags from text files? I was wondering if there was any simple utility, either graphical or Darwin, to strip HTML tags from an HTML (i.e., plaintext) file? A perl utility would be best, since I do a lot of work remotely on the terminal. TIA ![]()
__________________ //Gwailo// iMac TFT 700MHz G4, 786 RAM, 40GB Internal DVD-ROM/CD-RW 12x8x32 USB 64MB Flash Drive Wacom Graphire2 Tablet Epson 777i Colour Printer Canon PowerShot S30 Digital Camera JVC GR-DVF21 NTSC MiniDV Camera Canon EOS Elan II (35mm) "Like a beautiful flower full of colour and also fragrant, even so, fruitful are the fair words of one who practices them." --54th Surtra, The Dhammapada |
|
#2
| |||
| |||
| I'm sure there is a HTML parser module for perl. Try CPAN. You should be able to find one if not some. |
|
#3
| ||||
| ||||
| if all you want to do is strip the tags, you could do something like $html =~ s/<.+>//g; |
|
#4
| ||||
| ||||
| Quote:
![]()
__________________ //Gwailo// iMac TFT 700MHz G4, 786 RAM, 40GB Internal DVD-ROM/CD-RW 12x8x32 USB 64MB Flash Drive Wacom Graphire2 Tablet Epson 777i Colour Printer Canon PowerShot S30 Digital Camera JVC GR-DVF21 NTSC MiniDV Camera Canon EOS Elan II (35mm) "Like a beautiful flower full of colour and also fragrant, even so, fruitful are the fair words of one who practices them." --54th Surtra, The Dhammapada |
|
#5
| ||||
| ||||
| There's a utility call html2text on a NetBSD system I have an account on. I'm sure you could find it for Darwin. |
|
#6
| ||||
| ||||
| While I'm still going to look for a darwin alternative, I figured out that I could probably just open my web page in IE and select Save as Plain Text That'll have to suffice for now, but thanks for all the hints guys! ![]()
__________________ //Gwailo// iMac TFT 700MHz G4, 786 RAM, 40GB Internal DVD-ROM/CD-RW 12x8x32 USB 64MB Flash Drive Wacom Graphire2 Tablet Epson 777i Colour Printer Canon PowerShot S30 Digital Camera JVC GR-DVF21 NTSC MiniDV Camera Canon EOS Elan II (35mm) "Like a beautiful flower full of colour and also fragrant, even so, fruitful are the fair words of one who practices them." --54th Surtra, The Dhammapada |
|
#7
| ||||
| ||||
| Here: http://www.google.com/search?hl=en&i...-8&q=html2text . Looks like plenty of options. |
|
#8
| ||||
| ||||
| Perfect thanks Hazmat
__________________ //Gwailo// iMac TFT 700MHz G4, 786 RAM, 40GB Internal DVD-ROM/CD-RW 12x8x32 USB 64MB Flash Drive Wacom Graphire2 Tablet Epson 777i Colour Printer Canon PowerShot S30 Digital Camera JVC GR-DVF21 NTSC MiniDV Camera Canon EOS Elan II (35mm) "Like a beautiful flower full of colour and also fragrant, even so, fruitful are the fair words of one who practices them." --54th Surtra, The Dhammapada |
![]() |
| Thread Tools | |
|
|
Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| iTunes adds ID3 tags to AIF files? | phatcactus | Mac OS X System & Mac Software | 3 | March 17th, 2003 11:45 PM |
| Stripping html code out of a source file | Darkshadow | Software Programming & Web Scripting | 8 | March 9th, 2003 10:58 PM |
| Converting text files from windows to unix (OSX) and back | paulsomm | Mac OS X System & Mac Software | 1 | December 4th, 2001 03:23 PM |
| Text Edit and HTML | cutman1000 | Apple News, Rumors & Discussion | 1 | October 6th, 2001 06:46 PM |
| Editing plain old text files | Allan Crowson | Mac OS X System & Mac Software | 6 | December 4th, 2000 11:19 AM |