The code snippet below remove the characters from a string that is not inside the range of x20
and x7E
ASCII code. The regex below strips non-printable and control characters. But it also keeps the linefeed character n
(x0A
) and the carriage return r
(x0D
) characters.
package org.kodejava.regex;
public class ReplaceNonAscii {
public static void main(String[] args) {
String str = "Thè quïck brøwn føx jumps over the lãzy dôg.";
System.out.println("str = " + str);
// Replace all non ascii chars in the string.
str = str.replaceAll("[^\\x0A\\x0D\\x20-\\x7E]", "");
System.out.println("str = " + str);
}
}
Snippet output:
str = Thè quïck brøwn føx jumps over the lãzy dôg.
str = Th quck brwn fx jumps over the lzy dg.
Latest posts by Wayan (see all)
- How do I create a string of repeated characters? - September 1, 2023
- How do I convert datetime string with optional part to a date object? - August 28, 2023
- How do I split large excel file into multiple smaller files? - April 15, 2023