Using String.startsWith() may not consistently work when the string includes unicode characters

Question

Using String.startsWith() may not consistently work when the string includes unicode characters

Currently, I am in the process of creating a proof-of-concept conjugation practice application using Vue.js. One crucial aspect of this application is that when you enter an answer for a conjugation, it compares the input text using String.startswith(). However, a challenge arises when unicode characters are involved. In most cases, the unicode characters you type do not match those stored in the database. This discrepancy becomes evident even in a simple node CLI example where the "ț" character I type appears different from the one in the database, which is "ţ".

Below is an illustration of the typed input, its value, and unicode value compared:

input: anunț // anun\u21B
comparison: anunţ // anun\u163

I have experimented with methods like .normalize() but unfortunately, it seems that neither the inputted string nor the comparison string is affected by it.

> var input = 'anunț'
> var comparison = 'anunţ'
> input === comparison
false
> input.normalize() === comparison
false
> input.normalize() === comparison.normalize()
false
> input === comparison.normalize()
false
/// etc etc with NFC, NFD, NFKC, NFKD forms
> input.normalize()
'anunț'
> comparison.normalize()
'anunţ'

// i've also tried .normalize() with the string decoded into unicode

I attempted converting to unicode and manually replacing one set of strings, but this method has limitations, including issues such as difficulty in making positive comparisons until the entire string is entered.

Exploring regex comparisons was my next step, although I suspect it might lead me down another complex path.

At its core logic, without any previous attempts, here is what I aim to achieve:

if (this.conjugation.startsWith(this.input)) {
    this.status = "correct";
} else {
    this.status = "incorrect";
}

if (conjugation === val) {
    // okay, we are done
}

Any suggestions on how I could overcome this hurdle? Currently, I am focusing on testing with Romanian verbs, so the characters seem to fall within the following unicode ranges:

\u0000-\u007F, \u0180-\u024F, \u0100-\u017F

javascript vue.js unicode

Answer 1

Answer №1

To focus on specific differences, you can utilize Intl.Collator in JavaScript:

var word1 = "anunț"; // anun\u21B
var word2 = "anunţ"; // anun\u163

var collator = new Intl.Collator("ro", { sensitivity: "base" });

console.log(word1 === word2); // the words are not identical
console.log(collator.compare(word1, word2) == 0); // ... but they are considered "equal enough"

Answer 2

To focus on specific differences, you can utilize Intl.Collator in JavaScript:

var word1 = "anunț"; // anun\u21B
var word2 = "anunţ"; // anun\u163

var collator = new Intl.Collator("ro", { sensitivity: "base" });

console.log(word1 === word2); // the words are not identical
console.log(collator.compare(word1, word2) == 0); // ... but they are considered "equal enough"

Answer 3

Answer №2

Although sharing similarities, these two characters exhibit subtle differences. One features a space between the t and the lower comma mark, almost blending into the symbol.

Answer 4

Although sharing similarities, these two characters exhibit subtle differences. One features a space between the t and the lower comma mark, almost blending into the symbol.

Using String.startsWith() may not consistently work when the string includes unicode characters

Answer №1

Answer №2

Similar questions

I am unable to log in using bcryptjs, but I have successfully been able to register a

Creating a Vue application utilizing a function with an unspecified purpose

Troublesome GSP: JavaScript not functioning properly in Gr

Aligning the ant design carousel in the center of the page

Transferring information from a service to an AngularJS controller

The majority of my next.js website's content being indexed by Google consists of JSON and Javascript files

Identifying the hashKey and selected option in a dropdown menu

Labeling with the index number of arrays

What is the best way to retrieve the text when it is no longer within its original div?

Utilize JavaScript, jQuery, or Angular to incorporate identifications into <p> elements

The JQuery(document).ready function does not seem to be executing on the webpage, but it works as expected when placed in a

A guide on extracting/filtering information from JSON files using JavaScript

Unable to start initial Cucumber+javascript demonstration

The choice between using `npm run serve` and `serve -s dist` in Vue CLI

`What is the best way to employ the Return statement in programming?`

Modify code on click using JavaScript

Attempting to implement firebase configuration within my Vue component

The (window).keyup() function fails to trigger after launching a video within an iframe

Ways to widen the header to fit the entire page?

What could be causing Vue.js to malfunction when attempting to run it?