Skip to main content

15.10.2 Regexp Replacement

The M-x replace-string command replaces exact matches for a single string. The similar command M-x replace-regexp replaces any match for a specified regular expression pattern (see Regexps).

M-x replace-regexp RET regexp RET newstring RET​

Replace every match for regexp with newstring.

In replace-regexp, the newstring need not be constant: it can refer to all or part of what is matched by the regexp. β€˜\&’ in newstring stands for the entire match being replaced. β€˜\d’ in newstring, where d is a digit starting from 1, stands for whatever matched the dth parenthesized grouping in regexp. (This is called a β€œback reference".) β€˜\#’ refers to the count of replacements already made in this command, as a decimal number. In the first replacement, β€˜\#’ stands for β€˜0’; in the second, for β€˜1’; and so on. For example,

M-x replace-regexp RET c[ad]+r RET \&-safe RET

replaces (for example) β€˜cadr’ with β€˜cadr-safe’ and β€˜cddr’ with β€˜cddr-safe’.

M-x replace-regexp RET \(c[ad]+r\)-safe RET \1 RET

performs the inverse transformation. To include a β€˜\’ in the text to replace with, you must enter β€˜\\’.

If you want to enter part of the replacement string by hand each time, use β€˜\?’ in the replacement string. Each replacement will ask you to edit the replacement string in the minibuffer, putting point where the β€˜\?’ was.

The remainder of this subsection is intended for specialized tasks and requires knowledge of Lisp. Most readers can skip it.

You can use Lisp expressions to calculate parts of the replacement string. To do this, write β€˜\,’ followed by the expression in the replacement string. Each replacement calculates the value of the expression and converts it to text without quoting (if it’s a string, this means using the string’s contents), and uses it in the replacement string in place of the expression itself. If the expression is a symbol, one space in the replacement string after the symbol name goes with the symbol name, so the value replaces them both.

Inside such an expression, you can use some special sequences. β€˜\&’ and β€˜\d’ refer here, as usual, to the entire match as a string, and to a submatch as a string. d may be multiple digits, and the value of β€˜\d’ is nil if the d’th parenthesized grouping did not match. You can also use β€˜\#&’ and β€˜\#d’ to refer to those matches as numbers (this is valid when the match or submatch has the form of a numeral). β€˜\#’ here too stands for the number of already-completed replacements.

For example, we can exchange β€˜x’ and β€˜y’ this way:

M-x replace-regexp RET \(x\)\|y RET
\,(if \1 "y" "x") RET

For computing replacement strings for β€˜\,’, the format function is often useful (see Formatting Strings in The Emacs Lisp Reference Manual). For example, to add consecutively numbered strings like β€˜ABC00042’ to columns 73 toΒ 80 (unless they are already occupied), you can use

M-x replace-regexp RET ^.\{0,72\}$ RET
\,(format "%-72sABC%05d" \& \#) RET