Welcome to collectivesolver - Programming & Software Q&A with code examples. A website with trusted programming answers. All programs are tested and work.

Contact: aviboots(AT)netvision.net.il

Buy a domain name - Register cheap domain names from $0.99 - Namecheap

Scalable Hosting That Grows With You

Secure & Reliable Web Hosting, Free Domain, Free SSL, 1-Click WordPress Install, Expert 24/7 Support

Semrush - keyword research tool

Boost your online presence with premium web hosting and servers

Disclosure: My content contains affiliate links.

39,914 questions

51,847 answers

573 users

How to remove unicode characters of a specific range from a string in Java

1 Answer

0 votes
import java.io.UnsupportedEncodingException;
import java.util.regex.Matcher;
import java.util.regex.Pattern;

public class MyClass {
    public static void main(String args[]) {
        try {
            byte[] utf8Bytes = "java c c++  php".getBytes("UTF-8");

            String utf8 = new String(utf8Bytes, "UTF-8");

            Pattern unicode = Pattern.compile("[^\\x00-\\x7F]",
                                      Pattern.UNICODE_CASE | Pattern.CANON_EQ |
                                      Pattern.CASE_INSENSITIVE);
            Matcher matcher = unicode.matcher(utf8);

            utf8 = matcher.replaceAll(" ");
            
            System.out.println(utf8);
        } catch (UnsupportedEncodingException e) {
            e.printStackTrace();
        }
    }

}



/*
run:

java c c++   php

*/

 



answered Jul 27, 2020 by avibootz
...