calculate the count field dynamically based on $bucket boundaries in a MongoDB aggregate operation

Question

calculate the count field dynamically based on $bucket boundaries in a MongoDB aggregate operation

I'm currently utilizing the Mongo aggregate framework and have a collection structured like this:

[
  {
    _id: 123,
    name: "john",
    age: 30,
    fruit: "apple",
    
  },
  {
    _id: 345,
    name: "moore",
    age: 45,
    fruit: "mango",
    
  },
  {
    _id: 545,
    name: "carl",
    age: 30,
    fruit: "grape",
    
  },
  {
    _id: 96,
    name: "shelby",
    age: 25,
    fruit: "apple",
    
  },
  {
    _id: 86,
    name: "loris",
    age: 48,
    fruit: "mango",
    
  },
  {
    _id: 76,
    name: "carl",
    age: 55,
    fruit: "grape"
  }
]

My goal is to query and create a pipeline that returns the count of specific fruits falling under certain $bucket boundaries. The desired result should look like this...

[
  {
    "_id": Specific_Boundary,
    "userCount": Number_Of_Users_Falling_Under,
    "fruitsLie": [
                    {fruit_names_of_users_in_this_boundary : fruit_counts},
                  ]
  },
  {
    "_id": 0,
    "userCount": 3,
    "fruitsLie": [
                    {apple: 2},
                    {grape: 1}
                  ]
  },
  {
    "_id": 40,
    "userCount": 2,
    "fruitsLie": [
                    {mango: 2}
                 ]
  },
  {
    "_id": "more than 50",
    "userCount": 1,
    "fruitsLie": [
                    {grape: 1}
                 ]
  }
]

For example, under the age of 30 we have 3 users - 2 eat apples and 1 eats grapes, so the fruitsLie field performs these calculations.

What are the various approaches available to solve this problem with specific $bucket boundaries? Please provide a detailed explanation for each stage as I am new to aggregates and still learning...

javascript mongodb aggregation-framework aggregate aggregate-functions

Answer 1

Answer №1

Here is a method to achieve the desired outcome:

db.collection.aggregate([
  {
    "$bucket": {
      "groupBy": "$age",
      "boundaries": [
        0,
        31,
        41,
        51,
        
      ],
      "default": "More than 50",
      "output": {
        "users": {
          $push: "$$ROOT"
        }
      }
    }
  },
  {
    "$unwind": "$users"
  },
  {
    "$group": {
      "_id": {
        _id: "$_id",
        fruit: "$users.fruit"
      },
      "count": {
        "$sum": 1
      },
      
    }
  },
  {
    "$group": {
      "_id": "$_id._id",
      "fruitsLie": {
        "$push": {
          "$concatArrays": [
            [],
            [
              [
                "$$ROOT._id.fruit",
                "$$ROOT.count"
              ]
            ]
          ]
        }
      },
      usersCount: {
        $sum: "$$ROOT.count"
      }
    }
  },
  {
    "$addFields": {
      "fruitsLie": {
        "$map": {
          "input": "$fruitsLie",
          "as": "item",
          "in": {
            "$arrayToObject": "$$item"
          }
        }
      }
    }
  }
])

Visit the Playground for a hands-on experience.

The query workflow includes the following steps:

Grouping documents by age using $bucket into 4 distinct buckets, (0-30), (31-40), (41-50), and (>50) while aggregating users within each bucket.
Unwinding the users array utilizing the $unwind operator.
Calculating fruit counts within each bucket through the $group stage.
Aggregating counts per bucket into the fruitsLie array with another $group operation.
Converting elements of the fruitsLie array to objects using $arrayToObject.

Answer 2

Here is a method to achieve the desired outcome:

db.collection.aggregate([
  {
    "$bucket": {
      "groupBy": "$age",
      "boundaries": [
        0,
        31,
        41,
        51,
        
      ],
      "default": "More than 50",
      "output": {
        "users": {
          $push: "$$ROOT"
        }
      }
    }
  },
  {
    "$unwind": "$users"
  },
  {
    "$group": {
      "_id": {
        _id: "$_id",
        fruit: "$users.fruit"
      },
      "count": {
        "$sum": 1
      },
      
    }
  },
  {
    "$group": {
      "_id": "$_id._id",
      "fruitsLie": {
        "$push": {
          "$concatArrays": [
            [],
            [
              [
                "$$ROOT._id.fruit",
                "$$ROOT.count"
              ]
            ]
          ]
        }
      },
      usersCount: {
        $sum: "$$ROOT.count"
      }
    }
  },
  {
    "$addFields": {
      "fruitsLie": {
        "$map": {
          "input": "$fruitsLie",
          "as": "item",
          "in": {
            "$arrayToObject": "$$item"
          }
        }
      }
    }
  }
])

Visit the Playground for a hands-on experience.

The query workflow includes the following steps:

Grouping documents by age using $bucket into 4 distinct buckets, (0-30), (31-40), (41-50), and (>50) while aggregating users within each bucket.
Unwinding the users array utilizing the $unwind operator.
Calculating fruit counts within each bucket through the $group stage.
Aggregating counts per bucket into the fruitsLie array with another $group operation.
Converting elements of the fruitsLie array to objects using $arrayToObject.

calculate the count field dynamically based on $bucket boundaries in a MongoDB aggregate operation

Answer №1

Similar questions

The issue with Express connect-flash only showing after a page refresh instead of instantly displaying on the same page needs to be addressed

Issue with retrieving the ID of a dynamically created element with jQuery

Angular animation triggered when a specific condition is satisfied

Preventing the detection of a jshint "error"

What is the best way to showcase a single <ul> list in an infinite number of columns?

Issue: Inability to scroll on div overflow section

The message "Error: Unknown custom element: <router-view> - have you properly registered the component?" is prompting for a solution

Ways to switch out event listener when button is deactivated

Ensure that the date range picker consistently shows dates in a sequential order

The function is not being executed when using $scope.$apply()

The function GetSomething() in sequelize.js is displaying inaccurate values when used in a hasOne relationship, but is showing the correct values in a BelongTo

Exporting a React parent function to a child component

The ins and outs of implementing i18n on an Angular component library

The misleading A*(A-star) algorithm inaccurately produces faulty routes and ultimately collapses

Locate the nested route within one of the child components in React Router that corresponds to a specific id

Tips for resolving the issue of "Warning: useLayoutEffect does not have any effect on the server" when working with Material UI and reactDOMServer

Learn how to update a fixed value by adding the content entered into the Input textfield using Material-UI

Uploading Files Using the Dropbox API Version 2

Guide to running JavaScript in Selenium using JavaScript and retrieving the output

Can you provide tips on identifying children with the same Kineticjs type?