In my PC's local server, there is a cluster with approximately 240,000 entries of transaction data.
The abbreviation Cust_ID represents Customer ID.
https://i.sstatic.net/5g65l.png
Each file contains transactions made by different customers, with a total of about 3000 customers included.
The number of transactions varies for each customer.
The Cust_IDs are unique and randomly assigned to each customer, not in any specific order within the cluster.
How can I retrieve all transactions made by the first 50 customers without manually inputting each Customer ID?
When I say "1st," I am referring to the following scenario:
For example, if the entries are as follows:
{
Cust_ID: "1001",
Name: "ABC",
transaction: "1"
}
{
Cust_ID: "1001",
Name: "ABC",
transaction: "3"
}
{
Cust_ID: "1001",
Name: "ABC",
transaction: "6"
}
{
Cust_ID: "1092",
Name: "BCD",
transaction: "23"
}
{
Cust_ID: "1092",
Name: "BCD",
transaction: "12"
}
{
Cust_ID: "2104",
Name: "CDE",
transaction: "234"
}
{
Cust_ID: "1004",
Name: "DEF",
transaction: "3"
}
{
Cust_ID: "1551",
Name: "ASD",
transaction: "54"
}
/*The transaction details of 1st 3 customers would be:*/
{
Cust_ID: "1001",
Name: "ABC",
transaction: "1"
}
{
Cust_ID: "1001",
Name: "ABC",
transaction: "3"
}
{
Cust_ID: "1001",
Name: "ABC",
transaction: "6"
} //customer 1
{
Cust_ID: "1092",
Name: "BCD",
transaction: "23"
}
{
Cust_ID: "1092",
Name: "BCD",
transaction: "12"
} //customer 2
{
Cust_ID: "2104",
Name: "CDE",
transaction: "234"
} //customer 3