JavaScript text parsing in real-time changes

Question

JavaScript text parsing in real-time changes

I have a need to extract data from a webpage for scientific research purposes. The specific text I'm looking to extract is found within a < span > tag, but traditional HTML parsing methods won't work due to the rapid and constant updates happening, sometimes up to 10 times per second. Despite this challenge, I am aware that it can be achieved based on information from a scientific paper I came across.

The webpage where I need to gather this data from is: . Essentially, each time a paper is downloaded, a marker appears on the map indicating its location. My goal is to collect real-time data on the city/location associated with each marker as they appear, displayed beneath the map on the left side.

Questions:
1) How can I effectively parse this ever-changing text in real-time, especially considering that it's dynamically generated using Java-script code? While I have some experience with webpage parsing, handling fast-paced live text updates is new territory for me.

2) Given the importance of speed in both parsing and writing this data, which programming language would be best suited for my project? I intend to store the extracted data in an SQL database, so efficiency is key. If possible, I prefer to use Python provided there are robust libraries available for this purpose.

Thank you in advance for any guidance or recommendations you may have.

javascript sql parsing real-time data-mining

Answer 1

Answer №1

It appears that a JSON call is being made to retrieve map data. If you have the necessary authorization (such as a copyright notice), you can access the raw data directly by calling the specified URL instead of extracting it from the map.

$.getJSON('/ip2location/lookupMulti.php', { "rand": Math.random() }, function(data) {
    for (var i=0; i<data.length; i++) {
        var lat = data[i].lat;
        var lng = data[i].lng;
        var name = data[i].name;
    }
            // Additional code...

Keep in mind that many companies restrict frequent requests to their servers, whether it's through loading the main page or accessing lookupMulti.php. Without proper authorization, your IP address may be banned swiftly.

Answer 2

It appears that a JSON call is being made to retrieve map data. If you have the necessary authorization (such as a copyright notice), you can access the raw data directly by calling the specified URL instead of extracting it from the map.

$.getJSON('/ip2location/lookupMulti.php', { "rand": Math.random() }, function(data) {
    for (var i=0; i<data.length; i++) {
        var lat = data[i].lat;
        var lng = data[i].lng;
        var name = data[i].name;
    }
            // Additional code...

Keep in mind that many companies restrict frequent requests to their servers, whether it's through loading the main page or accessing lookupMulti.php. Without proper authorization, your IP address may be banned swiftly.

JavaScript text parsing in real-time changes

Answer №1

Similar questions

Iterate over the contents within the div tag

Sending optional data in Angular routesIn Angular, you can include additional

What causes arrays in JavaScript to not be sorted in either ascending or descending order based on dates?

Modify JSON date format to a shorter version using the first 2 letters of the month and the year

Auto-scroll feature malfunctioning

Merge two distinct JSON objects obtained through an API request using Javascript

Javascript menu toggle malfunctioning with sub-menus

Turning a string retrieved from the element's data attribute into a JSON format

Can we leverage map/filter/reduce functions within a promise by encapsulating the result with Promise.resolve()?

Make sure to tick off the checkboxes when another checkbox is marked

Both of the radio buttons in Material-UI have been selected

Steps to retrieve the latest value of a specific cell within the Material UI Data Grid

Procedures that are stored and utilize parameters

Is there a way for me to add a new column within the 'where'

Tips for dynamically adding an HTML element to index.html using JavaScript

"Patience is key when it comes to waiting for an HTTP response

Not all API results are being displayed by the Nextjs API function

What is causing the continuous appearance of null in the console log?

Is it possible for me to run two processes simultaneously on the same server - one to handle incoming requests and another to continuously poll a data storage system?

Using Restify to Serve CSS FilesLearn how to use Restify to