When attempting to access a webpage using a GET request, a string is returned but unfortunately I

Question

When attempting to access a webpage using a GET request, a string is returned but unfortunately I

As part of my project to create a custom Google Chrome extension, I encountered an interesting challenge. When I perform a GET request on the following web page URL:

https://www.rightmove.co.uk/property-for-sale/find.html?locationIdentifier=REGION%5E27675&maxBedrooms=2&minBedrooms=2&sortType=6&propertyTypes=&mustHave=&dontShow=&furnishTypes=&keywords=

, I receive the HTML response from the webpage as expected. The website I am accessing does not provide an API, and web scraping is not an option due to certain constraints.

The issue arises when I try to split the string I received at a specific point, namely bis_skin_checked. Surprisingly, despite this term being present in the string, the split operation returns an array with only one element, indicating that no match was found. I have attempted various methods such as eliminating spaces and line breaks, but none have proved successful so far. Below is the code snippet for my GET request:

function getNewPage(url) {
    let returnedValue = fetch(url, {
        method: 'GET',
        headers: {
            'Content-Type': 'text/html',
        },
    })
    .then(response => response.text())
    .then(text => {
        return text
    })

    return returnedValue
}

Following this, I move on to resolve the promise associated with returnedValue:

let newURL = getHousePrices(currentUrl) // Obtain Promise object representing the new page content

newURL.then(function(value) { // Resolve the promise and perform desired actions
    console.log(value.split('bis_skin_checked').length)
})

I then proceed to manipulate the retrieved string, which resembles the data shown in the image accessed via the following link (as direct text extraction is not feasible):

Image Link To API Request result

javascript google-chrome-extension

Answer 1

Answer №1

If you're looking to retrieve home values based on specific search criteria, there's a more efficient method than scraping raw text data. By analyzing the site's network requests, you can make adjustments to extract the necessary data directly without resorting to HTML scraping.

I've developed a solution that enables you to dynamically input your desired parameters into the getHomes() function. You can utilize the default parameters as a starting point and customize the request to suit different scenarios.

To implement this solution, install it below and execute the getHomes() function from the service worker.

You can watch a concise video tutorial I created to understand how the solution works:

--- manifest.JSON ---
{
    "name": "UK Housing - Stackoverflow",
    "description": "Example for how to make network requests and mimic them in background.js to avoid web scraping raw text",
    "version": "1.0.0",
    "manifest_version": 3,
    "background": {
        "service_worker": "background.js"
    },
    "host_permissions": [
        "*://*.rightmove.co.uk/*"
    ]
}

--- background.js ---
async function getHomes(passedParams) {

    const newParams = passedParams ? passedParams : {}; // set to an empty object if no new params passed - avoid error in object.entries loop.

    var params = {
        "locationIdentifier": "REGION%5E27675",
        "maxBedrooms": "2",
        "minBedrooms": "1",
        "numberOfPropertiesPerPage": "25",
        "radius": "0.0",
        "sortType": "6",
        "index": "0",
        "viewType": "LIST",
        "channel": "BUY",
        "areaSizeUnit": "sqft",
        "currencyCode": "GBP",
        "isFetching": "false"
    }

    Object.entries(params).forEach(([key, value]) => {
        if (newParams[key]) params[key] = newParams[key];
    });

    const rightMoveAPISearch = `https://www.rightmove.co.uk/api/_search?
        locationIdentifier=${params['locationIdentifier']}
        &maxBedrooms=${params['maxBedrooms']}
        &minBedrooms=${params['minBedrooms']}
        &numberOfPropertiesPerPage=${params['numberOfPropertiesPerPage']}
        &radius=${params['radius']}
        &sortType=${params['sortType']}
        &index=${params['index']}
        &viewType=${params['viewType']}
        &channel=${params['channel']}
        &areaSizeUnit=${params['areaSizeUnit']}
        &currencyCode=${params['currencyCode']}
        &isFetching=${params['isFetching']}
    `.replace(/\s/g, '');

    const data = await
        fetch(rightMoveAPISearch, {
            "method": "GET",
        })
        .then(data => data.json())
        .then(res => { return res })
    
    if (data.resultCount) {
        console.log('\x1b[32m%s\x1b[0m', 'Request successful! Result count: ', parseInt(data.resultCount));
        console.log('All data: ', data);
        console.log('Properties: ', data.properties);
    }
    else console.log('\x1b[31m%s\x1b[0m', `Issue with the request:`, data)

    return data

}

I trust this explanation proves beneficial. Feel free to reach out if you have any additional inquiries.

Answer 2

If you're looking to retrieve home values based on specific search criteria, there's a more efficient method than scraping raw text data. By analyzing the site's network requests, you can make adjustments to extract the necessary data directly without resorting to HTML scraping.

I've developed a solution that enables you to dynamically input your desired parameters into the getHomes() function. You can utilize the default parameters as a starting point and customize the request to suit different scenarios.

To implement this solution, install it below and execute the getHomes() function from the service worker.

You can watch a concise video tutorial I created to understand how the solution works:

--- manifest.JSON ---
{
    "name": "UK Housing - Stackoverflow",
    "description": "Example for how to make network requests and mimic them in background.js to avoid web scraping raw text",
    "version": "1.0.0",
    "manifest_version": 3,
    "background": {
        "service_worker": "background.js"
    },
    "host_permissions": [
        "*://*.rightmove.co.uk/*"
    ]
}

--- background.js ---
async function getHomes(passedParams) {

    const newParams = passedParams ? passedParams : {}; // set to an empty object if no new params passed - avoid error in object.entries loop.

    var params = {
        "locationIdentifier": "REGION%5E27675",
        "maxBedrooms": "2",
        "minBedrooms": "1",
        "numberOfPropertiesPerPage": "25",
        "radius": "0.0",
        "sortType": "6",
        "index": "0",
        "viewType": "LIST",
        "channel": "BUY",
        "areaSizeUnit": "sqft",
        "currencyCode": "GBP",
        "isFetching": "false"
    }

    Object.entries(params).forEach(([key, value]) => {
        if (newParams[key]) params[key] = newParams[key];
    });

    const rightMoveAPISearch = `https://www.rightmove.co.uk/api/_search?
        locationIdentifier=${params['locationIdentifier']}
        &maxBedrooms=${params['maxBedrooms']}
        &minBedrooms=${params['minBedrooms']}
        &numberOfPropertiesPerPage=${params['numberOfPropertiesPerPage']}
        &radius=${params['radius']}
        &sortType=${params['sortType']}
        &index=${params['index']}
        &viewType=${params['viewType']}
        &channel=${params['channel']}
        &areaSizeUnit=${params['areaSizeUnit']}
        &currencyCode=${params['currencyCode']}
        &isFetching=${params['isFetching']}
    `.replace(/\s/g, '');

    const data = await
        fetch(rightMoveAPISearch, {
            "method": "GET",
        })
        .then(data => data.json())
        .then(res => { return res })
    
    if (data.resultCount) {
        console.log('\x1b[32m%s\x1b[0m', 'Request successful! Result count: ', parseInt(data.resultCount));
        console.log('All data: ', data);
        console.log('Properties: ', data.properties);
    }
    else console.log('\x1b[31m%s\x1b[0m', `Issue with the request:`, data)

    return data

}

I trust this explanation proves beneficial. Feel free to reach out if you have any additional inquiries.

When attempting to access a webpage using a GET request, a string is returned but unfortunately I

Answer №1

Similar questions

Enhanced compatibility with Touch.radiusX feature on smartphone and tablet devices

Display an orange box (also known as a lightbox) when the page

Modify button behavior on click after the initial press

I have been utilizing ESBuild to compile JavaScript code for browser usage. However, I encountered an issue when trying to import CSS as I received an error message stating "Unexpected '.'". Can anyone provide guidance on how to resolve this issue?

An AJAX request will only occur if there is an alert triggered on a particular computer

Encountering difficulties in the installation of a package within Next JS

Sending a custom `GET` request with multiple IDs and additional parameters using Restangular can be achieved by following

dependency in useEffect hook not being properly updated

Uploading files with Vue.js Element-UI and axios triggers unwanted page refresh

Develop a custom input field feature that utilizes both JavaScript and CSS

The issue with CSS filter invert not functioning properly in Mozilla Firefox is causing complications

What is the process of modifying the data in DataTables once it has already been initialized?

Ways to generate multiple elements using JavaScript

What steps do I need to take in order to create functions that are

I'm not skilled in programming, so I'm not sure what the problem is with the code

Choosing all components except for one and its descendants

"Utilizing JavaScript to parse and extract data from a JSON

AJAX request: No values are being returned by $_GET

When swiping right with Swiper.js, the slides are jumping by all, skipping the following slide, but the left swipe functions correctly

Using jQuery AJAX to send POST requests in CodeIgniter