Locate the position of a substring within a Uint8Array

Question

Locate the position of a substring within a Uint8Array

I'm working with a Uint8Array that contains the content of a PDF file. My goal is to locate a specific string within this array in order to insert additional content at that particular position.

My current approach involves converting the Uint8Array into a string and then searching for the desired string within that newly created string.

Here's a snippet of my code:

    const pdfStr = new TextDecoder('utf-8').decode(array);
    
    // find ByteRange
            const byteRangePos = this.getSubstringIndex(pdfStr, '/ByteRange [', 1);
            if (byteRangePos === -1) {
                throw new Error(
                    'Failed to locate ByteRange.'
                );
            }
    
           getSubstringIndex = (str, substring, n) => {
            let times = 0, index = null;
    
            while (times < n && index !== -1) {
                index = str.indexOf(substring, index + 1);
                times++;
            }
    
            return index;
        }

array = this.updateArray(array, (byteRangePos + '/ByteRange '.length), byteRange);

The issue I'm facing is that utf-8 characters are encoded in variable-length bytes (ranging from 1 to 4 bytes). As a result, the length of the string I obtain is shorter than the actual length of the UInt8Array. This discrepancy causes the index derived from the string search to not align with where the '/ByteRange' string exists in the UInt8Array, leading to incorrect insertion placement.

Is there a method to obtain a 1-byte string representation of the UInt8Array, similar to ASCII?

javascript arraybuffer uint8array

Answer 1

Answer №1

My solution involved making a modification:

I updated <const pdfStr = new TextDecoder('utf-8').decode(array);>

to

<const pdfStr = new TextDecoder('ascii').decode(array);>

Answer 2

My solution involved making a modification:

I updated <const pdfStr = new TextDecoder('utf-8').decode(array);>

to

<const pdfStr = new TextDecoder('ascii').decode(array);>

Locate the position of a substring within a Uint8Array

Answer №1

Similar questions

Displaying unique input values with ng-model

The angular controller function is failing to set $scope.value

Enhancing the session helper in Silex with additional values

Issues with navigating sliders

Dividing a string in JavaScript to generate fresh arrays

Ways to fix the TypeError that occurs when attempting to convert undefined or null to an object by using Function.keys

Creating TypeScript models from a JSON response in React components

Sometimes, it feels like TypeScript's async await does not actually wait for the task to complete before moving on

Node.js - CSRF Protection Token Undefined

Using AngularJS to pass radio button value to a $http.post request

Issue: TypeError - The function addTicket is not recognized as a valid function. Utilize the useState hook within the modal component

The application is resetting when the "$http" method accesses the initial ADAL "protected" URL for the first time

The absence of AudioPlayer may be responsible for the compilation failure on Vercel

What's the CSS equivalent of Java's window.pack() method?

Tips for ensuring the drop down button remains selected

A step-by-step guide on displaying a loading spinner during the retrieval and assembly of a component framework (Astro Island) with Vue and AstroJS

Using dynamic tag names within React JSX can greatly enhance the flexibility and

Exploring Angular Factory: Creating the getAll method for accessing RESTful APIs

Maintaining the $index value across all pagination pages in AngularJS while matching it with the items in the data list

Updating the load context variable in Django template after making changes via an AJAX call: a step-by-step guide