FAQ Search Today's Posts Mark Forums Read
» Video Reviews

» Linux Archive

Linux-archive is a website aiming to archive linux email lists and to make them easily accessible for linux users/developers.


» Sponsor

» Partners

» Sponsor

Go Back   Linux Archive > Debian > Debian Development

 
 
LinkBack Thread Tools
 
Old 02-10-2008, 01:08 AM
Daniel Leidert
 
Default Bug#464957: ITP: html-xml-utils -- manipulate HTML and XML files

Package: wnpp
Severity: wishlist
Owner: Daniel Leidert <daniel.leidert@wgdd.de>

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

* Package name : html-xml-utils
Version : 4.4
Upstream Author : Bert Bos <bert@w3.org>
* URL : http://www.w3.org/Tools/HTML-XML-utils/
* License : W3C Software License (v20021231)
Programming Lang: C
Description : manipulate HTML and XML files

A number of simple utilities for manipulating HTML and XML files.
.
cexport - create headerfile of exported declarations from a C file
addid - add ID's to selected elements
cite - replace bibliographic references by hyperlinks
cite-mkbib - expand references and create bibliography
count - count elements and attributes in HTML or XML files files
extract - extract selected elements
htmlclean - apply heuristics to correct an HTML file
htmlprune - remove marked elements from an HTML file
incl - expand included HTML or XML files
index - create an alphabetically sorted index
mkbib - create bibliography from a template
multitoc - create a table of contents for a set of HTML files
name2id - move some ID= or NAME= from A elements to their parents
normalize - pretty-print an HTML file
num - number section headings in an HTML file
pipe - convert XML to a format easier to parse with Perl or AWK
printlinks - number links & add table of URLs at end of an HTML file
toc - insert a table of contents in an HTML file
uncdata - replace CDATA sections by character entities
unent - replace HTML predefined character entities to UTF-8
unpipe - convert output of pipe back to XML format
unxmlns - replace "global names" by XML Namespace prefixes
wls - list links in an HTML file
xmlns - replace XML Namespace prefixes by "global names"
xmlrecode - convert between UTF-8 and &#nnn; entities
asc2xml - convert from UTF-8 to &#nnn; entities
xml2asc - convert from &#nnn; entities to UTF-8
xref - generate cross-references
xselect - extract elements that match a(CSS) selector


- -- System Information:
Debian Release: lenny/sid
APT prefers unstable
APT policy: (850, 'unstable'), (700, 'testing'), (550, 'stable'), (110, 'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.22-3-k7 (SMP w/1 CPU core)
Locale: LANG=de_DE, LC_CTYPE=de_DE (charmap=ISO-8859-1)
Shell: /bin/sh linked to /bin/bash

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)

iD8DBQFHrlyzm0bx+wiPa4wRAoo+AKC1WL9uZlXXMTLDNd9Eoi JSRnZtdQCgxTxs
XgVm+ms2G7NuxPZcKNc/Hgs=
=HuPr
-----END PGP SIGNATURE-----



--
To UNSUBSCRIBE, email to debian-devel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
 
Old 02-10-2008, 02:41 AM
Frans Pop
 
Default Bug#464957: ITP: html-xml-utils -- manipulate HTML and XML files

Daniel Leidert wrote:
> A number of simple utilities for manipulating HTML and XML files.

Are all these intended to be command line utilities?
If so, some of them seem to have names that are extremely generic and
could therefore easily lead to confusion.

The following I find most problematic in that regard:
> cite - replace bibliographic references by hyperlinks
> count - count elements and attributes in HTML or XML files files
> extract - extract selected elements
> incl - expand included HTML or XML files
> index - create an alphabetically sorted index
> normalize - pretty-print an HTML file
> num - number section headings in an HTML file
> pipe - convert XML to a format easier to parse with Perl or AWK
> toc - insert a table of contents in an HTML file
> unpipe - convert output of pipe back to XML format
> wls - list links in an HTML file
> xref - generate cross-references
> xselect - extract elements that match a(CSS) selector

I would suggest renaming them by adding a suitable prefix.

Cheers,
FJP


--
To UNSUBSCRIBE, email to debian-devel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
 

Thread Tools




All times are GMT. The time now is 09:24 PM.

VBulletin, Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2007, Crawlability, Inc.
Copyright 2007 - 2008, www.linux-archive.org