# unidesc/cfg - "Build" settings for package

#---------------------------------------------------------------------

[technotes]

1. The source tarball used by this package has been renamed.

#---------------------------------------------------------------------

[buildtimes]

00.00 hours (or 000.17 minutes) - HP EliteBook 8560w 32GB RAM
00.00 hours (or 000.17 minutes) - ThinkPad E540 i7 4x2 16GB RAM
00.00 hours (or 000.22 minutes) - Dell Inspiron 6400 2.0 GHz Intel Duo
                                  7200 2GB RAM
00.01 hours (or 000.63 minutes) - Compaq 1.7 GHz Intel Pentium 4 512MB
                                  RAM

#---------------------------------------------------------------------

[settings]
build   = default
exepack = no
license = See "license*.txt" in installed tree
licfile = COPYING

#---------------------------------------------------------------------

[depends]
actools
ascii2bin
gawk
grep

#---------------------------------------------------------------------

[configure]
bash ./configure \
    --prefix=$PKGDIR_PROD  \
    --disable-dependency-tracking

#---------------------------------------------------------------------

# Original URLs. These URLs were valid at one point, but may have died
# since then. If you download newer versions of tarballs [etc.], don't
# delete  the  original versions,  as you may not  be able  to replace
# them.

[urls]
url_debian  = n/a
url_home    = http://billposer.org/Software/unidesc.html
url_lfs     = n/a

url_tarball = http://billposer.org/Software/Downloads/\
unidesc-2.23.tbz

#---------------------------------------------------------------------

[about]
This package provides five Unicode-related utilities:

uniname -  By default, "uniname" prints  the  character offset of each
character,  its byte offset,  its  hex-code value,  its encoding,  the
associated glyph, and its name.  Command line options allow  undesired
information to be suppressed and the Unicode range to be added.  Other
options permit a specified number of bytes or  characters to be  skip-
ped.

"unidesc" reports the character ranges to which  different portions of
the  text belong.  It can also be used to  identify  Unicode encodings
(e.g., UTF-16be) flagged by magic numbers.

"unihist" generates a histogram of the characters in its input,  which
must be encoded in UTF-8 Unicode.  By default, for each character,  it
prints the  frequency of the character as a  percentage of  the total,
the absolute number of tokens in the input,  the UTF-32 code  in hexa-
decimal,  and,  if the character is displayable,  the glyph  itself as
UTF-8 Unicode.  Command-line flags  allow  unwanted information  to be
suppressed.  In particular,  note that by suppressing the  percentages
and counts, it is possible to generate a list of the unique characters
in the input.

ExplicateUTF8 is intended for debugging or for learning about Unicode.
It determines and  explains the  validity of a  sequence of bytes as a
UTF8 encoding.

"utf8lookup"  is a  shell script which invokes "uniname" to provide an
easy way to look up the  character name  corresponding to  a codepoint
from the command line.

Note:  The "unidesc" package  requires the  external utility  "ascii2-
binary".