Parse a string containing a selection of markdown by using regular expressions for tokenization

Question

Parse a string containing a selection of markdown by using regular expressions for tokenization

I need to tokenize a string using a regular expression that consists of markdown formatting. Specifically, bold text is denoted by **text** and italic text is denoted by _text_.

To tokenize the string "a _b_ c **d** _e", it should be split into ['a ', 'b', ' c', 'd', '_e'] (Note: each match needs to be stored in its own group).

I have successfully captured bold and italic groups using the regex /_(.+?)_|\*\*(.+?)\*\*/g, but I am trying to expand this regex to include all other text as well. Essentially, I want to capture everything inside **, everything inside _, and the rest of the text.

I attempted to add another case with /_(.+?)_|\*\*(.+?)\*\*|(.*)/g, but this ends up capturing the previous cases as well.

(A quick way to test this in the browser console: Array.from('a _b_ c **d** _e'.matchAll(/_(.+?)_|\*\*(.+?)\*\*/g)))

javascript regex markdown

Answer 1

Answer №1

Utilize the unique unicode marker

/(?:\*{2})?_(\p{L}+)_(?:\*{2})?|(?:_)?\*{2}(\p{L}+)\*{2}(?:_)?|([*_\p{L}]+)/gu

:

displayResult(Array.from('x _y_ z **e** _f'.matchAll(/(?:\*{2})?_(\p{L}+)_(?:\*{2})?|(?:_)?\*{2}(\p{L}+)\*{2}(?:_)?|([*_\p{L}]+)/gu)))

Answer 2

Utilize the unique unicode marker

/(?:\*{2})?_(\p{L}+)_(?:\*{2})?|(?:_)?\*{2}(\p{L}+)\*{2}(?:_)?|([*_\p{L}]+)/gu

:

displayResult(Array.from('x _y_ z **e** _f'.matchAll(/(?:\*{2})?_(\p{L}+)_(?:\*{2})?|(?:_)?\*{2}(\p{L}+)\*{2}(?:_)?|([*_\p{L}]+)/gu)))

Parse a string containing a selection of markdown by using regular expressions for tokenization

Answer №1

Similar questions

"Regarding compatibility with different browsers - IE8, Firefox3.6, and Chrome: An inquiry on

Selecting a particular item in a list depending on time using JavaScript, jQuery, or Angular

Show only half of the Google Charts

How can one use PHP to locate the strings that fall between the specified strings?

Make the download window appear automatically when downloading a file

Invoking a Jquery function through a GridView link

Efficient ways to temporarily store form data in React JS

Hiding both clear buttons in the MUI Autocomplete component on Chromium: A simple guide

Is searching for duplicate entries in an array using a specific key?

Utilize React's Context Provider to centrally manage all state while incorporating async calls

Verify optional chaining support in Angular app for browsers

The positioning of CSS arrows using the "top" attribute is not relative to the top of the page when using absolute values

What is the best way to retrieve an error message within a sentence?

What is the proper way to incorporate an if statement within a return statement in React components?

Increasing values in Mongoose using $inc can be done by following these steps

Using AngularJS and Web API to generate a dropdown menu from a one-to-many relationship

After cloning the variable from props, the Vue 3 Composition API variable becomes undefined

Verifying picture quality prior to transferring to a remote server

What is the definition of a type that has the potential to encompass any subtree within an object through recursive processes?

The Autocomplete feature from the @react-google-maps/api component seems to be malfunctioning as it returns