# unidesc/cfg - "Build" settings for package #--------------------------------------------------------------------- [technotes] 1. The source tarball used by this package has been renamed. #--------------------------------------------------------------------- [buildtimes] 00.00 hours (or 000.17 minutes) - HP EliteBook 8560w 32GB RAM 00.00 hours (or 000.17 minutes) - ThinkPad E540 i7 4x2 16GB RAM 00.00 hours (or 000.22 minutes) - Dell Inspiron 6400 2.0 GHz Intel Duo 7200 2GB RAM 00.01 hours (or 000.63 minutes) - Compaq 1.7 GHz Intel Pentium 4 512MB RAM #--------------------------------------------------------------------- [settings] build = default exepack = yes license = See "license*.txt" in installed tree licfile = COPYING #--------------------------------------------------------------------- [depends] actools ascii2bin gawk grep #--------------------------------------------------------------------- [configure] bash ./configure \ --prefix=$PKGDIR_PROD \ --disable-dependency-tracking #--------------------------------------------------------------------- # Original URLs. These URLs were valid at one point, but may have died # since then. If you download newer versions of tarballs [etc.], don't # delete the original versions, as you may not be able to replace # them. [urls] url_debian = n/a url_home = http://billposer.org/Software/unidesc.html url_lfs = n/a url_tarball = http://billposer.org/Software/Downloads/\ unidesc-2.23.tbz #--------------------------------------------------------------------- [about] This package provides five Unicode-related utilities: uniname - By default, "uniname" prints the character offset of each character, its byte offset, its hex-code value, its encoding, the associated glyph, and its name. Command line options allow undesired information to be suppressed and the Unicode range to be added. Other options permit a specified number of bytes or characters to be skip- ped. "unidesc" reports the character ranges to which different portions of the text belong. It can also be used to identify Unicode encodings (e.g., UTF-16be) flagged by magic numbers. "unihist" generates a histogram of the characters in its input, which must be encoded in UTF-8 Unicode. By default, for each character, it prints the frequency of the character as a percentage of the total, the absolute number of tokens in the input, the UTF-32 code in hexa- decimal, and, if the character is displayable, the glyph itself as UTF-8 Unicode. Command-line flags allow unwanted information to be suppressed. In particular, note that by suppressing the percentages and counts, it is possible to generate a list of the unique characters in the input. ExplicateUTF8 is intended for debugging or for learning about Unicode. It determines and explains the validity of a sequence of bytes as a UTF8 encoding. "utf8lookup" is a shell script which invokes "uniname" to provide an easy way to look up the character name corresponding to a codepoint from the command line. Note: The "unidesc" package requires the external utility "ascii2- binary".