RegEx in JavaScript to identify and match the innerHTML property of all elements

Question

RegEx in JavaScript to identify and match the innerHTML property of all elements

I am currently in the process of developing a Chrome extension that needs to identify specific pages within a website, including the Log In / Sign In page, the Sign Up / Register page, the About page, and the Contact Us page.

My approach involves obtaining a list of elements on the page, which I have already accomplished. Now, I need to examine the innerHTML of each element to ensure it is a leaf node in the DOM and contains a portion of the keyword. I am attempting to accomplish this using a regex. Although I have successfully created a regex that extracts content between start or end tags of an element, it does not capture the innerHTML. Below is my progress so far, focusing on the About page:

var list = document.body.getElementsByTagName("*");
var aboutElement = /^[^<.+>].*About.*[^(<.+>]$/i;

for (var i = 0; i <= list.length; i++) {
    if ((aboutElement.test(list[i].innerHTML)) || (aboutElement.test(list[i].alt))) {
        list[i].click();
    }
}

I am seeking guidance on how to modify the regex pattern to only match leaf nodes (nodes without child nodes) and avoid capturing content within start or end tags. I suspect that the current regex pattern may match everything in the innerHTML due to the .* section, so adjustments might be necessary. Any assistance or suggestions would be highly appreciated!

javascript regex google-chrome-extension

Answer 1

Answer №1

Big shoutout to the helpful suggestions in the comments that steered me towards solving the issue. By utilizing the .textContent method and making adjustments to the regex as detailed below, I was able to successfully resolve the problem at hand.

var elements = document.body.getElementsByTagName("*");
var targetElement = /^(.*?\s*(\bTarget\b)[^$]*)$/i;

for (var j = 0; j <= elements.length; j++) {
    if ((targetElement.test(elements[j].textContent)) || (targetElement.test(elements[j].alt))) {
        elements[j].click();
    }
}

Answer 2

Big shoutout to the helpful suggestions in the comments that steered me towards solving the issue. By utilizing the .textContent method and making adjustments to the regex as detailed below, I was able to successfully resolve the problem at hand.

var elements = document.body.getElementsByTagName("*");
var targetElement = /^(.*?\s*(\bTarget\b)[^$]*)$/i;

for (var j = 0; j <= elements.length; j++) {
    if ((targetElement.test(elements[j].textContent)) || (targetElement.test(elements[j].alt))) {
        elements[j].click();
    }
}

RegEx in JavaScript to identify and match the innerHTML property of all elements

Answer №1

Similar questions

Node.js powered file uploading on the Heroku platform

How can I retrieve the parent scope in an Angular UI nested named view?

Eliminating the Skewed Appearance of Text in Input Fields Using CSS

Is it possible to enhance an external class with a non-static method using prototypes?

I created a customized version of jQuery tabs, but I am looking for an external link to display the tab content and to style the original navigation

One required positional argument is needed in `update_one()`: 'update' is missing

Using the hover event in a jQuery plugin: A step-by-step guide

Aligning images with absolute CSS to text that is wrapping positions the images perfectly alongside the text content

The FormData object appears to be blank, even though it was supposed to be populated when attempting to send a PDF file using a multipart FormData POST request in Cypress

Adding click functionality to dynamically generated list items in jQuery and HTML

Jquery Query: Is it possible to incorporate variables into CSS properties?

JQuery isn't functioning properly on dynamically generated divs from JavaScript

Utilizing Firebase's real-time database for executing specific conditional queries

What is the correct syntax for comparing -1, 0, and 1 in JavaScript?

Initiate Ant Design select reset

Adding Firebase Authentication to my AngularJS website

Wildcard for keys in JavaScript objects and JSON

What is the best way to retrieve all the listed TV and film ratings in descending order? Using Django

JQuery class for swapping elements upon scrolling

Load information into a different entity