Groupby 2 columns

tankwell · June 21, 2024, 2:47pm

I would like to groupby 2 columns of a big table in order to get a histogram / count for the combination. The table is a bit large, so when I choose to groupby the two columns in the dashboard it crashes (even when I filter the presented columns to 3 colmns, as requested)

2 possible directions I thought to go:

Filtering by the search expression the columns. For instance

runs.summary["table"][["c1","c2","c3"]]

This is throwing a syntax error, and I could not find a proper syntax to choose multiple columns at once to present from a large table.

Keep in that direction groupby
but to groupby multiple values.
Again, it worked well for a single column

runs.summary[“all_predictions”].table.rows.concat.groupby((row) => row[“c1”]).map((row, index) => {c1: row.groupkey, c2: row.groupkey, c3: row.groupkey, count: row.count}).

but it not showing all the combinations (I guess it’s only grouping the first column, and taking the first value of the other columns)

What do you suggest me to do?
p.s I would like to keep this as a dashboard / table / list but not as artifact query, since It’s supposed to be eventually within one of my reports. You may assume the run set is frozen (so that would not be too heavy)

Thanks!

fmamberti-wandb · June 27, 2024, 10:05am

Hi @tankwell. Thank you for reaching out with your question.

Would you mind sharing the URL for the Workspace and the table you are trying to groupby 2 columns for us to have a look at and investigate?

A query like the following:

runs.summary["table_name"].table.rows.concat.groupby((row) => {"c1": row["c1"], "c2": row["c2"]}).map((row, index) => {c1: row.groupkey["c1"], c2: row.groupkey["c2"], c3_count row["c3"].count})

replacing c1,c2,c3 with your columns names should render the table you are looking for.

tankwell · June 28, 2024, 2:38pm

Thanks!

I do get a running query, but I think that not exactly what I was looking for.

the general purpose was to go over all the possible combinations

and to show it as a paged table accross runs or to present the avg. counts across all runs.
The table is a bit heavy, so I thought that by explicitly typing the query and not using the ‘group_by’ option would be more efficient

fmamberti-wandb · July 9, 2024, 9:54am

Hi @tankwell , having a look a the table you shared , you should be able to re-created directly typing the following query:

runs.summary["all_predictions"].table.rows.concat.groupby((row) => {is_the_person_drinking_pred: row["is_the_person_drinking_pred"], is_the_person_eating_pred: row["is_the_person_eating_pred"], is_drinking_bool: row["is_drinking_bool"]}).map((row, index) => {is_the_person_drinking_pred: row.groupkey["is_the_person_drinking_pred"], is_the_person_eating_pred: row.groupkey["is_the_person_eating_pred"],is_drinking_bool: row.groupkey["is_drinking_bool"], Count: row["is_drinking_bool"].count})

The table may still take a couple of seconds to load depending on how many Runs you have selected.

tankwell · July 9, 2024, 11:55am

Thanks!

Now I get the overall count and not paged table with the correct count

fmamberti-wandb · July 10, 2024, 8:50am

Hi @tankwell , would you be able to share a screenshot of the table you are getting or a URL to a saved view with both the tables configured?
Are the same Runs being selected in the Workspace (if more than 100 Runs are visualised, only data from 100 of them will be queried for the table)?

fmamberti-wandb · July 12, 2024, 1:25pm

Hi @tankwell , I wanted to follow up on this request. Please let us know if we can be of further assistance or if your issue has been resolved.

tankwell · July 16, 2024, 9:23am

The issue was resolved with the map + count
I think that the issue was that my original query attempts were not very good
Thanks for supplying the right way to do it!

fmamberti-wandb · July 16, 2024, 9:50am

Great to hear this is now working as you intended! I will mark this as solved, please feel free to reach out in the future for any further questions.

Topic		Replies	Views
The difference between these two ways of joining tables in query panel? W&B Help dashboard , wandb	0	15	October 27, 2024
Plot confusion matrix v2 groupby some config field W&B Help wandb	3	216	May 2, 2024
Show single lines in groups W&B Help	5	356	June 15, 2022
How to create a panel that reports number of successful runs per group W&B Help dashboard	3	931	April 20, 2022
How to Join queries in Query Panel W&B Help	0	20	March 2, 2025

Groupby 2 columns

Related topics