Retrieving hashtags from a text

Question

Retrieving hashtags from a text

If I had a string like this

var feedback =  "Yum! #yummy #delicious at #CZ"

Is there an efficient way to extract all the hashtags from the string variable?

I attempted using JavaScript's split() method, but it seems cumbersome as I have to repeatedly split each new string generated from the initial one. Are there any simpler alternatives for accomplishing this task?

javascript hashtag

Answer 1

Answer №1

To locate instances of a hashtag followed by any non-space characters, you can employ a simple regular expression.

"Exploring #nature and #wildlife in the #jungle".match(/#\w+/g)
// Result: ["#nature", "#wildlife", "#jungle"]

Answer 2

To locate instances of a hashtag followed by any non-space characters, you can employ a simple regular expression.

"Exploring #nature and #wildlife in the #jungle".match(/#\w+/g)
// Result: ["#nature", "#wildlife", "#jungle"]

Answer 3

Answer №2

To find alphabetic characters in a string, use the following regex code. Feel free to customize it for other characters:

const result = myString.match(/#[a-z]+/gi);

Answer 4

To find alphabetic characters in a string, use the following regex code. Feel free to customize it for other characters:

const result = myString.match(/#[a-z]+/gi);

Answer 5

Answer №3

Are you interested in Unicode or hashtags in languages other than English?

"Mmmm #yummy #donut at #CZ #中文 #.dou #。#？#♥️ #にほ".match(/#[\p{L}]+/ugi)
=> (5) ["#yummy", "#donut", "#CZ", "#中文", "#にほ"]

This concept is further explained in the following answer:

\p{L} matches unicode characters

u the PCRE_UTF8 modifier, this modifier turns on additional functionality of PCRE that is incompatible with Perl.

Answer 6

Are you interested in Unicode or hashtags in languages other than English?

"Mmmm #yummy #donut at #CZ #中文 #.dou #。#？#♥️ #にほ".match(/#[\p{L}]+/ugi)
=> (5) ["#yummy", "#donut", "#CZ", "#中文", "#にほ"]

This concept is further explained in the following answer:

\p{L} matches unicode characters

u the PCRE_UTF8 modifier, this modifier turns on additional functionality of PCRE that is incompatible with Perl.

Answer 7

Answer №4

For those who value easy reading:

Extract hashtags from your text using: yourText.split(' ').filter(v=> v.startsWith('#'))

This code will output: ["#awesome", "#coffee", "#NYC"]

Answer 8

For those who value easy reading:

Extract hashtags from your text using: yourText.split(' ').filter(v=> v.startsWith('#'))

This code will output: ["#awesome", "#coffee", "#NYC"]

Answer 9

Answer №5

Here is a simple regular expression that allows emojis and numbers in hashtags without any white space:

"Mmmm #yummy #donut at #CZ#efrefg #:) #cool😎#r234#FEGERGR#fegergr".match(/#[^\s#]*/gmi);
// => ["#yummy", "#donut", "#CZ", "#efrefg", "#:)", "#cool😎", "#r234", "#FEGERGR", "#fegergr"]

One downside is that this regex may include punctuation at the end of hashtags:

"Mmmm #yummy.#donut#cool😎#r234#FEGERGR;#fegergr".match(/#[^\s#]*/gmi);
// => ["#yummy.", "#donut", "#cool😎", "#r234", "#FEGERGR;", "#fegergr"]

However, you can customize the regex to exclude certain characters, such as punctuation marks:

"Mmmm #yummy.#donut#cool😎#r234#FEGERGR;#fegergr".match(/#[^\s#\.\;]*/gmi);
// => ["#yummy", "#donut", "#cool😎", "#r234", "#FEGERGR", "#fegergr"]

Answer 10

Here is a simple regular expression that allows emojis and numbers in hashtags without any white space:

"Mmmm #yummy #donut at #CZ#efrefg #:) #cool😎#r234#FEGERGR#fegergr".match(/#[^\s#]*/gmi);
// => ["#yummy", "#donut", "#CZ", "#efrefg", "#:)", "#cool😎", "#r234", "#FEGERGR", "#fegergr"]

One downside is that this regex may include punctuation at the end of hashtags:

"Mmmm #yummy.#donut#cool😎#r234#FEGERGR;#fegergr".match(/#[^\s#]*/gmi);
// => ["#yummy.", "#donut", "#cool😎", "#r234", "#FEGERGR;", "#fegergr"]

However, you can customize the regex to exclude certain characters, such as punctuation marks:

"Mmmm #yummy.#donut#cool😎#r234#FEGERGR;#fegergr".match(/#[^\s#\.\;]*/gmi);
// => ["#yummy", "#donut", "#cool😎", "#r234", "#FEGERGR", "#fegergr"]

Answer 11

Answer №6

If you're looking to include characters from any alphabet in hashtags, consider using this method:

let text = "улетные #выходные // #holiday in the countryside";
const hashtags = []
if (text.length) {
    let preHashtags = text.split('#')
    let i = 0;
    if (text[0] !== '#') i++ 

    for (null; i < preHashtags.length; i++) {
        let item = preHashtags[i]
        hashtags.push(item.split(' ')[0]) 
        // String.prototype.split() is necessary to filter out non-hashtag related strings
    }
}


console.log(hashtags) // outputs [ 'выходные', 'holiday' ]

We use if (text[0] !== '#') i++ to check if the first letter in the "text" string is not a '#'. If it's not, we skip iterating through the first element in the preHashtags Array. Otherwise, if our text string starts with a hashtag, we need to process it.

Remember to validate the resulting hashtags array. The use of null in the for loop is purely for readability purposes; you could also use

for (;i < preHashtags.length; i++)

This method ensures that all symbols, including those from non-Latin alphabets, are included, making it beginner-friendly and easy to understand. In terms of performance, it excels in Chrome (and similar browsers like node.js), but performs slightly less efficiently in Firefox and Safari, as shown in this test: .

Consider your platform - whether running code in node.js or a browser, especially if targeting MobileSafari users, when deciding on this approach.

Answer 12

If you're looking to include characters from any alphabet in hashtags, consider using this method:

let text = "улетные #выходные // #holiday in the countryside";
const hashtags = []
if (text.length) {
    let preHashtags = text.split('#')
    let i = 0;
    if (text[0] !== '#') i++ 

    for (null; i < preHashtags.length; i++) {
        let item = preHashtags[i]
        hashtags.push(item.split(' ')[0]) 
        // String.prototype.split() is necessary to filter out non-hashtag related strings
    }
}


console.log(hashtags) // outputs [ 'выходные', 'holiday' ]

We use if (text[0] !== '#') i++ to check if the first letter in the "text" string is not a '#'. If it's not, we skip iterating through the first element in the preHashtags Array. Otherwise, if our text string starts with a hashtag, we need to process it.

Remember to validate the resulting hashtags array. The use of null in the for loop is purely for readability purposes; you could also use

for (;i < preHashtags.length; i++)

This method ensures that all symbols, including those from non-Latin alphabets, are included, making it beginner-friendly and easy to understand. In terms of performance, it excels in Chrome (and similar browsers like node.js), but performs slightly less efficiently in Firefox and Safari, as shown in this test: .

Consider your platform - whether running code in node.js or a browser, especially if targeting MobileSafari users, when deciding on this approach.

Answer 13

Answer №7

Parse the content and filter out any tags that start with a hashtag.

Answer 14

Parse the content and filter out any tags that start with a hashtag.

Retrieving hashtags from a text

Answer №1

Answer №2

Answer №3

Answer №4

Answer №5

Answer №6

Answer №7

Similar questions

Discover how to access all of the response headers from an HTTP request in Angular

Choose Selectize.js to auto-populate inputs

Having trouble with ng-click not correctly updating values within ng-repeat

How can I prevent a hyperlinked element from being clicked again after it has been clicked using JavaScript or jQuery in PHP

Extracting Object Properties in JavaScript with React

Assistance requested with Javascript for an HTML dropdown menu

The tension settings in Chart.JS appear unusual

Tips for choosing the injected HTML's List Element (li) while ensuring it remains concealed

How to arrange three buttons in a row using CSS styling

The callback function in AngularJS filters

Is it possible to deactivate an anchor tag based on the result of a conditional statement that returns a string?

Setting up KCFinder integration in TinyMCE

Anticipate await and fulfill promises

React blank state - State remains undefined after calling setState

What is the method for loading a subcategory based on the category by invoking a jQuery function within the <td> element of a JavaScript function that adds rows dynamically?

Invalid PDF File - Unable to Complete Download via $http

Django will not be replacing the outdated image

Encountering Typescript errors when trying to destructure a forEach loop from the output of

PHP/MySQL clarification regarding time slots

Tips for avoiding the persistence of an old array on the screen after refreshing and showing the new, updated array