4.10 The Case Table
You can customize case conversion by installing a special case table. A case table specifies the mapping between upper case and lower case letters. It affects both the case conversion functions for Lisp objects (see the previous section) and those that apply to text in the buffer (see Case Changes). Each buffer has a case table; there is also a standard case table which is used to initialize the case table of new buffers.
A case table is a char-table (see Char-Tables) whose subtype is case-table
. This char-table maps each character into the corresponding lower case character. It has three extra slots, which hold related tables:
upcase
β
The upcase table maps each character into the corresponding upper case character.
canonicalize
β
The canonicalize table maps all of a set of case-related characters into a particular member of that set.
equivalences
β
The equivalences table maps each one of a set of case-related characters into the next character in that set.
In simple cases, all you need to specify is the mapping to lower-case; the three related tables will be calculated automatically from that one.
For some languages, upper and lower case letters are not in one-to-one correspondence. There may be two different lower case letters with the same upper case equivalent. In these cases, you need to specify the maps for both lower case and upper case.
The extra table canonicalize
maps each character to a canonical equivalent; any two characters that are related by case-conversion have the same canonical equivalent character. For example, since βa
β and βA
β are related by case-conversion, they should have the same canonical equivalent character (which should be either βa
β for both of them, or βA
β for both of them).
The extra table equivalences
is a map that cyclically permutes each equivalence class (of characters with the same canonical equivalent). (For ordinary ASCII, this would map βa
β into βA
β and βA
β into βa
β, and likewise for each set of equivalent characters.)
When constructing a case table, you can provide nil
for canonicalize
; then Emacs fills in this slot from the lower case and upper case mappings. You can also provide nil
for equivalences
; then Emacs fills in this slot from canonicalize
. In a case table that is actually in use, those components are non-nil
. Do not try to specify equivalences
without also specifying canonicalize
.
Here are the functions for working with case tables:
function
case-table-p objectβ
This predicate returns non-nil
if object
is a valid case table.
function
set-standard-case-table tableβ
This function makes table
the standard case table, so that it will be used in any buffers created subsequently.
function
standard-case-tableβ
This returns the standard case table.
function
current-case-tableβ
This function returns the current bufferβs case table.
function
set-case-table tableβ
This sets the current bufferβs case table to table
.
macro
with-case-table table bodyβ¦β
The with-case-table
macro saves the current case table, makes table
the current case table, evaluates the body
forms, and finally restores the case table. The return value is the value of the last form in body
. The case table is restored even in case of an abnormal exit via throw
or error (see Nonlocal Exits).
Some language environments modify the case conversions of ASCII characters; for example, in the Turkish language environment, the ASCII capital I is downcased into a Turkish dotless i (βΔ±
β). This can interfere with code that requires ordinary ASCII case conversion, such as implementations of ASCII-based network protocols. In that case, use the with-case-table
macro with the variable ascii-case-table
, which stores the unmodified case table for the ASCII character set.
variable
ascii-case-tableβ
The case table for the ASCII character set. This should not be modified by any language environment settings.
The following three functions are convenient subroutines for packages that define non-ASCII character sets. They modify the specified case table case-table
; they also modify the standard syntax table. See Syntax Tables. Normally you would use these functions to change the standard case table.
function
set-case-syntax-pair uc lc case-tableβ
This function specifies a pair of corresponding letters, one upper case and one lower case.
function
set-case-syntax-delims l r case-tableβ
This function makes characters l
and r
a matching pair of case-invariant delimiters.
function
set-case-syntax char syntax case-tableβ
This function makes char
case-invariant, with syntax syntax
.
command
describe-buffer-case-tableβ
This command displays a description of the contents of the current bufferβs case table.