Pattern to identify a JSON string with Regular Expressions

Question

Pattern to identify a JSON string with Regular Expressions

Currently, I am working on developing a JSON validator from the ground up and have hit a roadblock when it comes to the string component. My original plan was to create a regex pattern that aligns with the sequence specified on JSON.org:

https://i.sstatic.net/JKW9V.gif

Here is the regex I have come up with so far:

/^\"((?=\\)\\(\"|\/|\\|b|f|n|r|t|u[0-9a-f]{4}))*\"$/

This regex successfully matches instances where there is a backslash followed by a character and an empty string. However, my dilemma lies in incorporating UNICODE characters.

Is there a regex pattern that can identify any UNICODE character excluding " or \ or control characters? Will it also detect a newline or horizontal tab?

I noticed that while the regex matches the string "\t", it does not recognize " " (four spaces meant to signify a tab). While extending the regex is an option, my hunch is that the horizontal tab is a UNICODE character.

Credit goes to Jaeger Kor for updating my regex to the following:

/^\"((?=\\)\\(\"|\/|\\|b|f|n|r|t|u[0-9a-f]{4})|[^\\"]*)*\"$/

This revised regex seems accurate, but should I be checking for control characters separately? Or is this unnecessary considering they fall under non-printable characters as per regular-expressions.info? The input being validated always originates from a textarea.

As an update, here is the finalized regex for reference:

/^("(((?=\\)\\(["\\\/bfnrt]|u[0-9a-fA-F]{4}))|[^"\\\0-\x1F\x7F]+)*")$/

javascript json regex unicode

Answer 1

Answer №1

If you have a specific question, you can create a character class for it:

# Use this to match any character except \ or "
/[^\\"]/

You can then add * at the end for zero or unlimited occurrences, or use + for one or more:

/[^\\"]*/

Another option is provided below, found on https://regex101.com/ under the library tab when searching for json:

/(?(DEFINE)
# This regex defines atomic patterns for JSON without backtracking
(?<json>(?>\s*(?&object)\s*|\s*(?&array)\s*))
(?<object>(?>\{\s*(?>(?&pair)(?>\s*,\s*(?&pair))*)?\s*\}))
(?<pair>(?>(?&STRING)\s*:\s*(?&value)))
(?<array>(?>>\[\s*(?>(?&value)(?>\s*,\s*(?&value))*)?\s*\]))
(?<value>(?>>true|false|null|(?&STRING)|(?&NUMBER)|(?&object)|(?&array)))
(?<STRING>(?>>"(?>\\(?>["\\\/bfnrt]|u[a-fA-F0-9]{4})|[^"\\\0-\x1F\x7F]+)*"))
(?<NUMBER>(?>>-?(?>0|[1-9][0-9]*)(?>.\.[0-9]+)?(?>[eE][+-]?[0-9]+)?))
)
\A(?&json)\z/x

This regex matches any valid JSON format and can be tested on the website mentioned above.

EDIT:

Link to the regex

Answer 2

If you have a specific question, you can create a character class for it:

# Use this to match any character except \ or "
/[^\\"]/

You can then add * at the end for zero or unlimited occurrences, or use + for one or more:

/[^\\"]*/

Another option is provided below, found on https://regex101.com/ under the library tab when searching for json:

/(?(DEFINE)
# This regex defines atomic patterns for JSON without backtracking
(?<json>(?>\s*(?&object)\s*|\s*(?&array)\s*))
(?<object>(?>\{\s*(?>(?&pair)(?>\s*,\s*(?&pair))*)?\s*\}))
(?<pair>(?>(?&STRING)\s*:\s*(?&value)))
(?<array>(?>>\[\s*(?>(?&value)(?>\s*,\s*(?&value))*)?\s*\]))
(?<value>(?>>true|false|null|(?&STRING)|(?&NUMBER)|(?&object)|(?&array)))
(?<STRING>(?>>"(?>\\(?>["\\\/bfnrt]|u[a-fA-F0-9]{4})|[^"\\\0-\x1F\x7F]+)*"))
(?<NUMBER>(?>>-?(?>0|[1-9][0-9]*)(?>.\.[0-9]+)?(?>[eE][+-]?[0-9]+)?))
)
\A(?&json)\z/x

This regex matches any valid JSON format and can be tested on the website mentioned above.

EDIT:

Link to the regex

Answer 3

Answer №2

Try using this regular expression, which can also handle arrays of JSON objects:

((\[[^\}]{3,})?\{s*[^\}\{]{3,}?:.*\}([^\{]+\])?)

Check out the demo here: https://regex101.com/r/aHAnJL/1

Answer 4

Try using this regular expression, which can also handle arrays of JSON objects:

((\[[^\}]{3,})?\{s*[^\}\{]{3,}?:.*\}([^\{]+\])?)

Check out the demo here: https://regex101.com/r/aHAnJL/1

Pattern to identify a JSON string with Regular Expressions

Answer №1

Answer №2

Similar questions

Incorporating a File Attachment within a JSON Structure

Secure access to an API using a certificate within a Vue.js application running on localhost

How do webpack imports behave within a ReactJS project?

Showing the date in AngularJSAngularJS can be used to

Why does tsc produce a compiled file that throws an exception when executed, while ts-node successfully runs the TypeScript file without any issues?

Node.js application experiences a delay when calling Mongoose Model.save()

Guide on enabling users to input slide number and URL address to load content using Ajax

Simplify your code with promises in JavaScript

What is the connection between tsconfig.json and typings.json files?

What is the jQuery syntax for targeting a specific element within an object?

Tips on avoiding duplicate selection of checkboxes with Vue.js

Is it possible for me to include a variable with the xmlhttp response text?

Why am I unable to "append a list" or "update a dictionary" while parsing through the lines of a text file?

Is it possible to retrieve a local variable from a JavaScript function and utilize it outside of its

Top method for identifying "abandoned words" within text that has been wrapped

Step-by-step guide on redirecting a page using AJAX while also sending data to a controller

What is the best way to make this eerie javascript script communicate with my webpage or another jquery file?

Adding npm packages to your Vue.js application

Ways to verify if a variable holds a JSON object or a string

Utilize data from a dynamically loaded component within a parent component in Vue.js