Skip to content

Database
- Oracle
- SQL
C
C++
Java
Java Script
jQuery
PHP

Database
- Oracle
- SQL
C
C++
Java
Java Script
jQuery
PHP

Calculate weighted average using a pandas/dataframe

I think I would do this with two groupbys.

First to calculate the “weighted average”:

In [11]: g = df.groupby('Date')

In [12]: df.value / g.value.transform("sum") * df.wt
Out[12]:
0    0.125000
1    0.250000
2    0.416667
3    0.277778
4    0.444444
dtype: float64

If you set this as a column, you can groupby over it:

In [13]: df['wa'] = df.value / g.value.transform("sum") * df.wt

Now the sum of this column is the desired:

In [14]: g.wa.sum()
Out[14]:
Date
01/01/2012    0.791667
01/02/2012    0.722222
Name: wa, dtype: float64

or potentially:

In [15]: g.wa.transform("sum")
Out[15]:
0    0.791667
1    0.791667
2    0.791667
3    0.722222
4    0.722222
Name: wa, dtype: float64

Related Posts:

ValueError: Unknown label type: ‘continuous’
How to fix IndexError: invalid index to scalar variable
Convert pandas dataframe to NumPy array
ImportError: Missing required dependencies [‘numpy’]
Python Pandas – Missing required dependencies [‘numpy’] 1
‘DataFrame’ object has no attribute ‘sort’
‘DataFrame’ object has no attribute ‘sort’
TypeError: cannot unpack non-iterable int objec
‘DataFrame’ object has no attribute ‘sort’
What is dtype(‘O’), in pandas?
What is dtype(‘O’), in pandas?
TypeError: ‘DataFrame’ object is not callable
ValueError: ‘object too deep for desired array’
What does axis in pandas mean?
How to take column-slices of dataframe in pandas
Normalize data in pandas
Creating a Pandas DataFrame from a Numpy array: How do I specify the index column and column headers?
pandas create new column based on values from other columns / apply a function of multiple columns, row-wise
Building multi-regression model throws error: `Pandas data cast to numpy dtype of object. Check input data with np.asarray(data).`
Coalesce values from 2 columns into a single column in a pandas dataframe
Concat DataFrame Reindexing only valid with uniquely valued Index objects
Replacing Pandas or Numpy Nan with a None to use with MysqlDB
Difference between data type ‘datetime64[ns]’ and ‘

Merging two DataFrames
vectorize conditional assignment in pandas dataframe
how to sort pandas dataframe from one column
TypeError: only integer scalar arrays can be converted to a scalar index with 1D numpy indices array
ImportError: DLL load failed: The specified module could not be found
ImportError: DLL load failed: The specified module could not be found
Renaming column names in Pandas
How to reset index in a pandas dataframe? [duplicate]
Delete a column from a Pandas DataFrame
Import Error: No module named numpy
How to deal with SettingWithCopyWarning in Pandas
How to deal with SettingWithCopyWarning in Pandas
Constructing pandas DataFrame from values in variables gives “ValueError: If using all scalar values, you must pass an index”
How to iterate over rows in a DataFrame in Pandas
pandas read_json: “If using all scalar values, you must pass an index”
How to iterate over rows in a DataFrame in Pandas
ValueError: setting an array element with a sequence
Writing a pandas DataFrame to CSV file
numpy max vs amax vs maximum
Truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all()
Truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all()
Writing a pandas DataFrame to CSV file
What exactly does numpy.exp() do? [closed]
The difference between comparison to np.nan and isnull()
Difference between import numpy and import numpy as np
Adding new column to existing DataFrame in Python pandas
Modifing data while using iterrows() does not work
ImportError: No module named pandas
How to change the order of DataFrame columns?
What is the purpose of meshgrid in Python / NumPy?
why numpy.ndarray is object is not callable in my simple for python loop
numpy division with RuntimeWarning: invalid value encountered in double_scalars
Numpy ValueError: setting an array element with a sequence. This message may appear without the existing of a sequence?
How to change the order of DataFrame columns?
numpy: Invalid value encountered in true_divide
ImportError: No module named pandas. Pandas installed pip
python numpy ValueError: operands could not be broadcast together with shapes
What does `ValueError: cannot reindex from a duplicate axis` mean?
Pandas DataFrame Groupby two columns and get counts
How to fix ‘Object arrays cannot be loaded when allow_pickle=False’ for imdb.load_data() function?
How do I create an empty array/matrix in NumPy?
Most efficient way to find mode in numpy array
How can I use the apply() function for a single column?
How to show all columns’ names on a large pandas dataframe?
Convenient way to deal with ValueError: cannot reindex from a duplicate axis
How to groupby based on two columns in pandas?
TypeError: unhashable type: ‘numpy.ndarray’
“Series objects are mutable and cannot be hashed” error
Could not install packages due to a “Environment error :[error 13]: permission denied : ‘usr/local/bin/f2py'”
numpy division with RuntimeWarning: invalid value encountered in double_scalars
How does numpy.newaxis work and when to use it?
How to deal with SettingWithCopyWarning in Pandas
numpy matrix vector multiplication
Converting list to numpy array
Merging dataframes on index with pandas
How do I read CSV data into a record array in NumPy?
ImportError: No module named pandas
TypeError: ‘Series’ objects are mutable, thus they cannot be hashed problemwith column
Create a Pandas Dataframe by appending one row at a time
How to replace NaN values by Zeroes in a column of a Pandas Dataframe?
ValueError: Length of values does not match length of index | Pandas DataFrame.unique()
data type not understood
How do you do natural logs (e.g. “ln()”) with numpy in Python?
How do I read CSV data into a record array in NumPy?
Plotting a 2D heatmap with Matplotlib
How to normalize a NumPy array to a unit vector?
Convert Python dict into a dataframe
Should I use np.absolute or np.abs?
re.sub erroring with “Expected string or bytes-like object”
Creating an empty Pandas DataFrame, then filling it?
How do I select rows from a DataFrame based on column values?
How do I select rows from a DataFrame based on column values?
What does numpy.random.seed(0) do?
DataFrame constructor not properly called! error
Pandas group-by and sum
How do I get the row count of a Pandas DataFrame?
ImportError: numpy.core.multiarray failed to import

Categories Python Tags numpy, pandas, python

How do I escape ampersands in XML so they are rendered as entities in HTML?

Why am I getting ‘Assembly ‘*.dll’ must be strong signed in order to be marked as a prerequisite.’?

Leave a Comment Cancel reply

Comment

Name Email Website

Search for:

Recommended Hostings

Cloudways: Realize Your Website's Potential With Flexible & Affordable Hosting. 24/7/365 Support, Managed Security, Automated Backups, and 24/7 Real-time Monitoring.

FastComet: Fast SSD Hosting, Free Migration, Hack-Free Security, 24/7 Super Fast Support, 45 Day Money Back Guarantee.

Recent Added Topics

Bug in translation system: load_theme_textdomain() returns true, files are available and accessible but the language defaults to english
Custom Elementor controls not appearing in the widget Advanced tab using injection hooks
Get the name of the template/*html file used
Trying to Add Paging to Single Post Page
Sharing media files between live and staging servers
How to display the description of a custom post type in the dashboard?
Critical error on image display
Copying WP data and files into new install?
How to determine the DirectAdmin WordPress backup date?
How to get list of ALL tables in the database?

© 2026 Read For Learn

Database
- Oracle
- SQL
algorithm
asp.net
assembly
binary
c#
Git
hex
HTML
iOS
language angnostic
math
matlab
Tips & Trick
Tools
windows
C
C++
Java
javascript
Python
R
Java Script
jQuery
PHP
WordPress