Pandas DataFrames: Create new rows with calculations across existing rows Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern) The Ask Question Wizard is Live! Data science time! April 2019 and salary with experience Should we burninate the [wrap] tag?Dynamic Expression Evaluation in pandas using pd.eval()Add one row to pandas DataFrameSelecting multiple columns in a pandas dataframeAdding new column to existing DataFrame in Python pandasDelete column from pandas DataFrame by column nameHow to drop rows of Pandas DataFrame whose value in certain columns is NaN“Large data” work flows using pandasHow do I get the row count of a pandas DataFrame?How to iterate over rows in a DataFrame in Pandas?Select rows from a DataFrame based on values in a column in pandasGet list from pandas DataFrame column headers

How to assign captions for two tables in LaTeX?

How discoverable are IPv6 addresses and AAAA names by potential attackers?

How do I stop a creek from eroding my steep embankment?

When to stop saving and start investing?

The logistics of corpse disposal

Bonus calculation: Am I making a mountain out of a molehill?

Why don't the Weasley twins use magic outside of school if the Trace can only find the location of spells cast?

ListPlot join points by nearest neighbor rather than order

Can inflation occur in a positive-sum game currency system such as the Stack Exchange reputation system?

How to recreate this effect in Photoshop?

What are the pros and cons of Aerospike nosecones?

How do I determine if the rules for a long jump or high jump are applicable for Monks?

How do I keep my slimes from escaping their pens?

What are the motives behind Cersei's orders given to Bronn?

"Seemed to had" is it correct?

How can I fade player when goes inside or outside of the area?

Did Kevin spill real chili?

What is the musical term for a note that continously plays through a melody?

When is phishing education going too far?

How to find all the available tools in macOS terminal?

Is the address of a local variable a constexpr?

If a contract sometimes uses the wrong name, is it still valid?

Is there a documented rationale why the House Ways and Means chairman can demand tax info?

Is there a concise way to say "all of the X, one of each"?

Pandas DataFrames: Create new rows with calculations across existing rows

Announcing the arrival of Valued Associate #679: Cesar Manara

Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)

The Ask Question Wizard is Live!

Data science time! April 2019 and salary with experience

Should we burninate the [wrap] tag?Dynamic Expression Evaluation in pandas using pd.eval()Add one row to pandas DataFrameSelecting multiple columns in a pandas dataframeAdding new column to existing DataFrame in Python pandasDelete column from pandas DataFrame by column nameHow to drop rows of Pandas DataFrame whose value in certain columns is NaN“Large data” work flows using pandasHow do I get the row count of a pandas DataFrame?How to iterate over rows in a DataFrame in Pandas?Select rows from a DataFrame based on values in a column in pandasGet list from pandas DataFrame column headers

.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;

How can I create new rows from an existing DataFrame by grouping by certain fields (in the example "Country" and "Industry") and applying some math to another field (in the example "Field" and "Value")?

Source DataFrame

df = pd.DataFrame('Country': ['USA','USA','USA','USA','USA','USA','Canada','Canada'],
 'Industry': ['Finance', 'Finance', 'Retail', 
 'Retail', 'Energy', 'Energy', 
 'Retail', 'Retail'],
 'Field': ['Import', 'Export','Import', 
 'Export','Import', 'Export',
 'Import', 'Export'],
 'Value': [100, 50, 80, 10, 20, 5, 30, 10])

 Country Industry Field Value
0 USA Finance Import 100
1 USA Finance Export 50
2 USA Retail Import 80
3 USA Retail Export 10
4 USA Energy Import 20
5 USA Energy Export 5
6 Canada Retail Import 30
7 Canada Retail Export 10

Target DataFrame

Net = Import - Export

 Country Industry Field Value
0 USA Finance Net 50
1 USA Retail Net 70
2 USA Energy Net 15
3 Canada Retail Net 20

edited 2 days ago

Scott Boston

58.7k73258

asked 2 days ago

Lorenz

625

add a comment |

Source DataFrame

df = pd.DataFrame('Country': ['USA','USA','USA','USA','USA','USA','Canada','Canada'],
 'Industry': ['Finance', 'Finance', 'Retail', 
 'Retail', 'Energy', 'Energy', 
 'Retail', 'Retail'],
 'Field': ['Import', 'Export','Import', 
 'Export','Import', 'Export',
 'Import', 'Export'],
 'Value': [100, 50, 80, 10, 20, 5, 30, 10])

 Country Industry Field Value
0 USA Finance Import 100
1 USA Finance Export 50
2 USA Retail Import 80
3 USA Retail Export 10
4 USA Energy Import 20
5 USA Energy Export 5
6 Canada Retail Import 30
7 Canada Retail Export 10

Target DataFrame

Net = Import - Export

 Country Industry Field Value
0 USA Finance Net 50
1 USA Retail Net 70
2 USA Energy Net 15
3 Canada Retail Net 20

edited 2 days ago

Scott Boston

58.7k73258

asked 2 days ago

Lorenz

625

add a comment |

Source DataFrame

df = pd.DataFrame('Country': ['USA','USA','USA','USA','USA','USA','Canada','Canada'],
 'Industry': ['Finance', 'Finance', 'Retail', 
 'Retail', 'Energy', 'Energy', 
 'Retail', 'Retail'],
 'Field': ['Import', 'Export','Import', 
 'Export','Import', 'Export',
 'Import', 'Export'],
 'Value': [100, 50, 80, 10, 20, 5, 30, 10])

 Country Industry Field Value
0 USA Finance Import 100
1 USA Finance Export 50
2 USA Retail Import 80
3 USA Retail Export 10
4 USA Energy Import 20
5 USA Energy Export 5
6 Canada Retail Import 30
7 Canada Retail Export 10

Target DataFrame

Net = Import - Export

 Country Industry Field Value
0 USA Finance Net 50
1 USA Retail Net 70
2 USA Energy Net 15
3 Canada Retail Net 20

edited 2 days ago

Scott Boston

58.7k73258

asked 2 days ago

Lorenz

625

Source DataFrame

df = pd.DataFrame('Country': ['USA','USA','USA','USA','USA','USA','Canada','Canada'],
 'Industry': ['Finance', 'Finance', 'Retail', 
 'Retail', 'Energy', 'Energy', 
 'Retail', 'Retail'],
 'Field': ['Import', 'Export','Import', 
 'Export','Import', 'Export',
 'Import', 'Export'],
 'Value': [100, 50, 80, 10, 20, 5, 30, 10])

 Country Industry Field Value
0 USA Finance Import 100
1 USA Finance Export 50
2 USA Retail Import 80
3 USA Retail Export 10
4 USA Energy Import 20
5 USA Energy Export 5
6 Canada Retail Import 30
7 Canada Retail Export 10

Target DataFrame

Net = Import - Export

 Country Industry Field Value
0 USA Finance Net 50
1 USA Retail Net 70
2 USA Energy Net 15
3 Canada Retail Net 20

python pandas dataframe

edited 2 days ago

Scott Boston

58.7k73258

asked 2 days ago

Lorenz

625

edited 2 days ago

Scott Boston

58.7k73258

asked 2 days ago

Lorenz

625

edited 2 days ago

Scott Boston

58.7k73258

edited 2 days ago

Scott Boston

58.7k73258

edited 2 days ago

Scott Boston

58.7k73258

asked 2 days ago

Lorenz

625

asked 2 days ago

Lorenz

625

asked 2 days ago

Lorenz

625

add a comment |

5 Answers
5

active

oldest

votes

There are quite possibly many ways. Here's one using groupby and unstack:

(df.groupby(['Country', 'Industry', 'Field'], sort=False)['Value']
 .sum()
 .unstack('Field')
 .eval('Import - Export')
 .reset_index(name='Value'))

 Country Industry Value
0 USA Finance 50
1 USA Retail 70
2 USA Energy 15
3 Canada Retail 20

edited yesterday

answered 2 days ago

coldspeed

142k25159247

1

By far the best answer. The unstack followed by eval is a really nice trick — better than a second groupby and get_group I would have done

– BallpointBen
2 days ago

1

@BallpointBen eval and query are personal favourites of mine from the API. I've made attempts to popularise their use, but their usage is not completely understood. I have a QnA here, if you are interested.

– coldspeed
2 days ago

Works like a charm. Thank you very much. Very small comment - there is a closing bracket missing in the last line.

– Lorenz
yesterday

@Lorenz Oops... fixed, thanks!

– coldspeed
yesterday

@coldspeed Actually I think there’s a better way… see my answer. unstack is expensive because it reshapes. Using the structure of the first groupby is more efficient, although it takes two lines.

– BallpointBen
yesterday

|
show 1 more comment

IIUC

df=df.set_index(['Country','Industry'])

Newdf=(df.loc[df.Field=='Export','Value']-df.loc[df.Field=='Import','Value']).reset_index().assign(Field='Net')
Newdf
 Country Industry Value Field
0 USA Finance -50 Net
1 USA Retail -70 Net
2 USA Energy -15 Net
3 Canada Retail -20 Net

pivot_table

df.pivot_table(index=['Country','Industry'],columns='Field',values='Value',aggfunc='sum').
 diff(axis=1).
 dropna(1).
 rename(columns='Import':'Value').
 reset_index()
Out[112]: 
Field Country Industry Value
0 Canada Retail 20.0
1 USA Energy 15.0
2 USA Finance 50.0
3 USA Retail 70.0

edited 2 days ago

answered 2 days ago

Wen-Ben

126k83872

add a comment |

You can use Groupby.diff() and after that recreate the Field column and finally use DataFrame.dropna:

df['Value'] = df.groupby(['Country', 'Industry'])['Value'].diff().abs()
df['Field'] = 'Net'
df.dropna(inplace=True)
df.reset_index(drop=True, inplace=True)

print(df)
 Country Industry Field Value
0 USA Finance Net 50.0
1 USA Retail Net 70.0
2 USA Energy Net 15.0
3 Canada Retail Net 20.0

answered 2 days ago

Erfan

3,4381419

add a comment |

You can do it this way to add those rows to your original dataframe:

df.set_index(['Country','Industry','Field'])
 .unstack()['Value']
 .eval('Net = Import - Export')
 .stack().rename('Value').reset_index()

Output:

 Country Industry Field Value
0 Canada Retail Export 10
1 Canada Retail Import 30
2 Canada Retail Net 20
3 USA Energy Export 5
4 USA Energy Import 20
5 USA Energy Net 15
6 USA Finance Export 50
7 USA Finance Import 100
8 USA Finance Net 50
9 USA Retail Export 10
10 USA Retail Import 80
11 USA Retail Net 70

answered 2 days ago

Scott Boston

58.7k73258

Thanks - actually, I wanted to append it to the original df. So, nice trick to do this all in one command,

– Lorenz
yesterday

1

Coldspeed‘s answer was a slight better fit to my overall code. Took from your code how you appended the result to the original df. Very tight result, though. Pitty that i can not accept two answers. But thanks again!

– Lorenz
yesterday

add a comment |

This answer takes advantage of the fact that pandas puts the group keys in the multiindex of the resulting dataframe. (If there were only one group key, you could use loc.)

>>> s = df.groupby(['Country', 'Industry', 'Field'])['Value'].sum()
>>> s.xs('Import', axis=0, level='Field') - s.xs('Export', axis=0, level='Field')
Country Industry
Canada Retail 20
USA Energy 15
 Finance 50
 Retail 70
Name: Value, dtype: int64

answered yesterday

BallpointBen

3,7681639

add a comment |

Your Answer

StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);

);

draft saved

draft discarded

StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55670192%2fpandas-dataframes-create-new-rows-with-calculations-across-existing-rows%23new-answer', 'question_page');

);

Post as a guest

Name

Required, but never shown

5 Answers
5

active

oldest

votes

5 Answers
5

active

oldest

votes

There are quite possibly many ways. Here's one using groupby and unstack:

(df.groupby(['Country', 'Industry', 'Field'], sort=False)['Value']
 .sum()
 .unstack('Field')
 .eval('Import - Export')
 .reset_index(name='Value'))

 Country Industry Value
0 USA Finance 50
1 USA Retail 70
2 USA Energy 15
3 Canada Retail 20

edited yesterday

answered 2 days ago

coldspeed

142k25159247

1

By far the best answer. The unstack followed by eval is a really nice trick — better than a second groupby and get_group I would have done

– BallpointBen
2 days ago

1

@BallpointBen eval and query are personal favourites of mine from the API. I've made attempts to popularise their use, but their usage is not completely understood. I have a QnA here, if you are interested.

– coldspeed
2 days ago

Works like a charm. Thank you very much. Very small comment - there is a closing bracket missing in the last line.

– Lorenz
yesterday

@Lorenz Oops... fixed, thanks!

– coldspeed
yesterday

@coldspeed Actually I think there’s a better way… see my answer. unstack is expensive because it reshapes. Using the structure of the first groupby is more efficient, although it takes two lines.

– BallpointBen
yesterday

|
show 1 more comment

There are quite possibly many ways. Here's one using groupby and unstack:

(df.groupby(['Country', 'Industry', 'Field'], sort=False)['Value']
 .sum()
 .unstack('Field')
 .eval('Import - Export')
 .reset_index(name='Value'))

 Country Industry Value
0 USA Finance 50
1 USA Retail 70
2 USA Energy 15
3 Canada Retail 20

edited yesterday

answered 2 days ago

coldspeed

142k25159247

1

By far the best answer. The unstack followed by eval is a really nice trick — better than a second groupby and get_group I would have done

– BallpointBen
2 days ago

1

@BallpointBen eval and query are personal favourites of mine from the API. I've made attempts to popularise their use, but their usage is not completely understood. I have a QnA here, if you are interested.

– coldspeed
2 days ago

Works like a charm. Thank you very much. Very small comment - there is a closing bracket missing in the last line.

– Lorenz
yesterday

@Lorenz Oops... fixed, thanks!

– coldspeed
yesterday

@coldspeed Actually I think there’s a better way… see my answer. unstack is expensive because it reshapes. Using the structure of the first groupby is more efficient, although it takes two lines.

– BallpointBen
yesterday

|
show 1 more comment

There are quite possibly many ways. Here's one using groupby and unstack:

(df.groupby(['Country', 'Industry', 'Field'], sort=False)['Value']
 .sum()
 .unstack('Field')
 .eval('Import - Export')
 .reset_index(name='Value'))

 Country Industry Value
0 USA Finance 50
1 USA Retail 70
2 USA Energy 15
3 Canada Retail 20

edited yesterday

answered 2 days ago

coldspeed

142k25159247

There are quite possibly many ways. Here's one using groupby and unstack:

(df.groupby(['Country', 'Industry', 'Field'], sort=False)['Value']
 .sum()
 .unstack('Field')
 .eval('Import - Export')
 .reset_index(name='Value'))

 Country Industry Value
0 USA Finance 50
1 USA Retail 70
2 USA Energy 15
3 Canada Retail 20

edited yesterday

answered 2 days ago

coldspeed

142k25159247

edited yesterday

answered 2 days ago

coldspeed

142k25159247

answered 2 days ago

coldspeed

142k25159247

answered 2 days ago

coldspeed

142k25159247

1

By far the best answer. The unstack followed by eval is a really nice trick — better than a second groupby and get_group I would have done

– BallpointBen
2 days ago

1

@BallpointBen eval and query are personal favourites of mine from the API. I've made attempts to popularise their use, but their usage is not completely understood. I have a QnA here, if you are interested.

– coldspeed
2 days ago

Works like a charm. Thank you very much. Very small comment - there is a closing bracket missing in the last line.

– Lorenz
yesterday

@Lorenz Oops... fixed, thanks!

– coldspeed
yesterday

@coldspeed Actually I think there’s a better way… see my answer. unstack is expensive because it reshapes. Using the structure of the first groupby is more efficient, although it takes two lines.

– BallpointBen
yesterday

|
show 1 more comment

1

By far the best answer. The unstack followed by eval is a really nice trick — better than a second groupby and get_group I would have done

– BallpointBen
2 days ago

1

@BallpointBen eval and query are personal favourites of mine from the API. I've made attempts to popularise their use, but their usage is not completely understood. I have a QnA here, if you are interested.

– coldspeed
2 days ago

Works like a charm. Thank you very much. Very small comment - there is a closing bracket missing in the last line.

– Lorenz
yesterday

@Lorenz Oops... fixed, thanks!

– coldspeed
yesterday

@coldspeed Actually I think there’s a better way… see my answer. unstack is expensive because it reshapes. Using the structure of the first groupby is more efficient, although it takes two lines.

– BallpointBen
yesterday

By far the best answer. The unstack followed by eval is a really nice trick — better than a second groupby and get_group I would have done

– BallpointBen
2 days ago

@BallpointBen eval and query are personal favourites of mine from the API. I've made attempts to popularise their use, but their usage is not completely understood. I have a QnA here, if you are interested.

– coldspeed
2 days ago

Works like a charm. Thank you very much. Very small comment - there is a closing bracket missing in the last line.

– Lorenz
yesterday

@Lorenz Oops... fixed, thanks!

– coldspeed
yesterday

@coldspeed Actually I think there’s a better way… see my answer. unstack is expensive because it reshapes. Using the structure of the first groupby is more efficient, although it takes two lines.

– BallpointBen
yesterday

|
show 1 more comment

IIUC

df=df.set_index(['Country','Industry'])

Newdf=(df.loc[df.Field=='Export','Value']-df.loc[df.Field=='Import','Value']).reset_index().assign(Field='Net')
Newdf
 Country Industry Value Field
0 USA Finance -50 Net
1 USA Retail -70 Net
2 USA Energy -15 Net
3 Canada Retail -20 Net

pivot_table

df.pivot_table(index=['Country','Industry'],columns='Field',values='Value',aggfunc='sum').
 diff(axis=1).
 dropna(1).
 rename(columns='Import':'Value').
 reset_index()
Out[112]: 
Field Country Industry Value
0 Canada Retail 20.0
1 USA Energy 15.0
2 USA Finance 50.0
3 USA Retail 70.0

edited 2 days ago

answered 2 days ago

Wen-Ben

126k83872

add a comment |

IIUC

df=df.set_index(['Country','Industry'])

Newdf=(df.loc[df.Field=='Export','Value']-df.loc[df.Field=='Import','Value']).reset_index().assign(Field='Net')
Newdf
 Country Industry Value Field
0 USA Finance -50 Net
1 USA Retail -70 Net
2 USA Energy -15 Net
3 Canada Retail -20 Net

pivot_table

df.pivot_table(index=['Country','Industry'],columns='Field',values='Value',aggfunc='sum').
 diff(axis=1).
 dropna(1).
 rename(columns='Import':'Value').
 reset_index()
Out[112]: 
Field Country Industry Value
0 Canada Retail 20.0
1 USA Energy 15.0
2 USA Finance 50.0
3 USA Retail 70.0

edited 2 days ago

answered 2 days ago

Wen-Ben

126k83872

add a comment |

IIUC

df=df.set_index(['Country','Industry'])

Newdf=(df.loc[df.Field=='Export','Value']-df.loc[df.Field=='Import','Value']).reset_index().assign(Field='Net')
Newdf
 Country Industry Value Field
0 USA Finance -50 Net
1 USA Retail -70 Net
2 USA Energy -15 Net
3 Canada Retail -20 Net

pivot_table

df.pivot_table(index=['Country','Industry'],columns='Field',values='Value',aggfunc='sum').
 diff(axis=1).
 dropna(1).
 rename(columns='Import':'Value').
 reset_index()
Out[112]: 
Field Country Industry Value
0 Canada Retail 20.0
1 USA Energy 15.0
2 USA Finance 50.0
3 USA Retail 70.0

edited 2 days ago

answered 2 days ago

Wen-Ben

126k83872

IIUC

df=df.set_index(['Country','Industry'])

Newdf=(df.loc[df.Field=='Export','Value']-df.loc[df.Field=='Import','Value']).reset_index().assign(Field='Net')
Newdf
 Country Industry Value Field
0 USA Finance -50 Net
1 USA Retail -70 Net
2 USA Energy -15 Net
3 Canada Retail -20 Net

pivot_table

df.pivot_table(index=['Country','Industry'],columns='Field',values='Value',aggfunc='sum').
 diff(axis=1).
 dropna(1).
 rename(columns='Import':'Value').
 reset_index()
Out[112]: 
Field Country Industry Value
0 Canada Retail 20.0
1 USA Energy 15.0
2 USA Finance 50.0
3 USA Retail 70.0

edited 2 days ago

answered 2 days ago

Wen-Ben

126k83872

edited 2 days ago

answered 2 days ago

Wen-Ben

126k83872

answered 2 days ago

Wen-Ben

126k83872

answered 2 days ago

Wen-Ben

126k83872

add a comment |

You can use Groupby.diff() and after that recreate the Field column and finally use DataFrame.dropna:

df['Value'] = df.groupby(['Country', 'Industry'])['Value'].diff().abs()
df['Field'] = 'Net'
df.dropna(inplace=True)
df.reset_index(drop=True, inplace=True)

print(df)
 Country Industry Field Value
0 USA Finance Net 50.0
1 USA Retail Net 70.0
2 USA Energy Net 15.0
3 Canada Retail Net 20.0

answered 2 days ago

Erfan

3,4381419

add a comment |

You can use Groupby.diff() and after that recreate the Field column and finally use DataFrame.dropna:

df['Value'] = df.groupby(['Country', 'Industry'])['Value'].diff().abs()
df['Field'] = 'Net'
df.dropna(inplace=True)
df.reset_index(drop=True, inplace=True)

print(df)
 Country Industry Field Value
0 USA Finance Net 50.0
1 USA Retail Net 70.0
2 USA Energy Net 15.0
3 Canada Retail Net 20.0

answered 2 days ago

Erfan

3,4381419

add a comment |

You can use Groupby.diff() and after that recreate the Field column and finally use DataFrame.dropna:

df['Value'] = df.groupby(['Country', 'Industry'])['Value'].diff().abs()
df['Field'] = 'Net'
df.dropna(inplace=True)
df.reset_index(drop=True, inplace=True)

print(df)
 Country Industry Field Value
0 USA Finance Net 50.0
1 USA Retail Net 70.0
2 USA Energy Net 15.0
3 Canada Retail Net 20.0

answered 2 days ago

Erfan

3,4381419

You can use Groupby.diff() and after that recreate the Field column and finally use DataFrame.dropna:

df['Value'] = df.groupby(['Country', 'Industry'])['Value'].diff().abs()
df['Field'] = 'Net'
df.dropna(inplace=True)
df.reset_index(drop=True, inplace=True)

print(df)
 Country Industry Field Value
0 USA Finance Net 50.0
1 USA Retail Net 70.0
2 USA Energy Net 15.0
3 Canada Retail Net 20.0

answered 2 days ago

Erfan

3,4381419

answered 2 days ago

Erfan

3,4381419

answered 2 days ago

Erfan

3,4381419

answered 2 days ago

Erfan

3,4381419

add a comment |

You can do it this way to add those rows to your original dataframe:

df.set_index(['Country','Industry','Field'])
 .unstack()['Value']
 .eval('Net = Import - Export')
 .stack().rename('Value').reset_index()

Output:

 Country Industry Field Value
0 Canada Retail Export 10
1 Canada Retail Import 30
2 Canada Retail Net 20
3 USA Energy Export 5
4 USA Energy Import 20
5 USA Energy Net 15
6 USA Finance Export 50
7 USA Finance Import 100
8 USA Finance Net 50
9 USA Retail Export 10
10 USA Retail Import 80
11 USA Retail Net 70

answered 2 days ago

Scott Boston

58.7k73258

Thanks - actually, I wanted to append it to the original df. So, nice trick to do this all in one command,

– Lorenz
yesterday

1

Coldspeed‘s answer was a slight better fit to my overall code. Took from your code how you appended the result to the original df. Very tight result, though. Pitty that i can not accept two answers. But thanks again!

– Lorenz
yesterday

add a comment |

You can do it this way to add those rows to your original dataframe:

df.set_index(['Country','Industry','Field'])
 .unstack()['Value']
 .eval('Net = Import - Export')
 .stack().rename('Value').reset_index()

Output:

 Country Industry Field Value
0 Canada Retail Export 10
1 Canada Retail Import 30
2 Canada Retail Net 20
3 USA Energy Export 5
4 USA Energy Import 20
5 USA Energy Net 15
6 USA Finance Export 50
7 USA Finance Import 100
8 USA Finance Net 50
9 USA Retail Export 10
10 USA Retail Import 80
11 USA Retail Net 70

answered 2 days ago

Scott Boston

58.7k73258

Thanks - actually, I wanted to append it to the original df. So, nice trick to do this all in one command,

– Lorenz
yesterday

1

Coldspeed‘s answer was a slight better fit to my overall code. Took from your code how you appended the result to the original df. Very tight result, though. Pitty that i can not accept two answers. But thanks again!

– Lorenz
yesterday

add a comment |

You can do it this way to add those rows to your original dataframe:

df.set_index(['Country','Industry','Field'])
 .unstack()['Value']
 .eval('Net = Import - Export')
 .stack().rename('Value').reset_index()

Output:

 Country Industry Field Value
0 Canada Retail Export 10
1 Canada Retail Import 30
2 Canada Retail Net 20
3 USA Energy Export 5
4 USA Energy Import 20
5 USA Energy Net 15
6 USA Finance Export 50
7 USA Finance Import 100
8 USA Finance Net 50
9 USA Retail Export 10
10 USA Retail Import 80
11 USA Retail Net 70

answered 2 days ago

Scott Boston

58.7k73258

You can do it this way to add those rows to your original dataframe:

df.set_index(['Country','Industry','Field'])
 .unstack()['Value']
 .eval('Net = Import - Export')
 .stack().rename('Value').reset_index()

Output:

 Country Industry Field Value
0 Canada Retail Export 10
1 Canada Retail Import 30
2 Canada Retail Net 20
3 USA Energy Export 5
4 USA Energy Import 20
5 USA Energy Net 15
6 USA Finance Export 50
7 USA Finance Import 100
8 USA Finance Net 50
9 USA Retail Export 10
10 USA Retail Import 80
11 USA Retail Net 70

answered 2 days ago

Scott Boston

58.7k73258

answered 2 days ago

Scott Boston

58.7k73258

answered 2 days ago

Scott Boston

58.7k73258

answered 2 days ago

Scott Boston

58.7k73258

Thanks - actually, I wanted to append it to the original df. So, nice trick to do this all in one command,

– Lorenz
yesterday

1

Coldspeed‘s answer was a slight better fit to my overall code. Took from your code how you appended the result to the original df. Very tight result, though. Pitty that i can not accept two answers. But thanks again!

– Lorenz
yesterday

add a comment |

Thanks - actually, I wanted to append it to the original df. So, nice trick to do this all in one command,

– Lorenz
yesterday

1

Coldspeed‘s answer was a slight better fit to my overall code. Took from your code how you appended the result to the original df. Very tight result, though. Pitty that i can not accept two answers. But thanks again!

– Lorenz
yesterday

Thanks - actually, I wanted to append it to the original df. So, nice trick to do this all in one command,

– Lorenz
yesterday

Coldspeed‘s answer was a slight better fit to my overall code. Took from your code how you appended the result to the original df. Very tight result, though. Pitty that i can not accept two answers. But thanks again!

– Lorenz
yesterday

add a comment |

This answer takes advantage of the fact that pandas puts the group keys in the multiindex of the resulting dataframe. (If there were only one group key, you could use loc.)

>>> s = df.groupby(['Country', 'Industry', 'Field'])['Value'].sum()
>>> s.xs('Import', axis=0, level='Field') - s.xs('Export', axis=0, level='Field')
Country Industry
Canada Retail 20
USA Energy 15
 Finance 50
 Retail 70
Name: Value, dtype: int64

answered yesterday

BallpointBen

3,7681639

add a comment |

This answer takes advantage of the fact that pandas puts the group keys in the multiindex of the resulting dataframe. (If there were only one group key, you could use loc.)

>>> s = df.groupby(['Country', 'Industry', 'Field'])['Value'].sum()
>>> s.xs('Import', axis=0, level='Field') - s.xs('Export', axis=0, level='Field')
Country Industry
Canada Retail 20
USA Energy 15
 Finance 50
 Retail 70
Name: Value, dtype: int64

answered yesterday

BallpointBen

3,7681639

add a comment |

This answer takes advantage of the fact that pandas puts the group keys in the multiindex of the resulting dataframe. (If there were only one group key, you could use loc.)

>>> s = df.groupby(['Country', 'Industry', 'Field'])['Value'].sum()
>>> s.xs('Import', axis=0, level='Field') - s.xs('Export', axis=0, level='Field')
Country Industry
Canada Retail 20
USA Energy 15
 Finance 50
 Retail 70
Name: Value, dtype: int64

answered yesterday

BallpointBen

3,7681639

This answer takes advantage of the fact that pandas puts the group keys in the multiindex of the resulting dataframe. (If there were only one group key, you could use loc.)

>>> s = df.groupby(['Country', 'Industry', 'Field'])['Value'].sum()
>>> s.xs('Import', axis=0, level='Field') - s.xs('Export', axis=0, level='Field')
Country Industry
Canada Retail 20
USA Energy 15
 Finance 50
 Retail 70
Name: Value, dtype: int64

answered yesterday

BallpointBen

3,7681639

answered yesterday

BallpointBen

3,7681639

answered yesterday

BallpointBen

3,7681639

answered yesterday

BallpointBen

3,7681639

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Stack Overflow!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Ttdfjt

5 Answers
5

Your Answer

Post as a guest

5 Answers
5

5 Answers
5

Post as a guest

Popular posts from this blog

Grendel Contents Story Scholarship Depictions Notes References Navigation menu10.1093/notesj/gjn112Berserkeree

Log in Navigation menu

5 Answers 5

Your Answer

Sign up or log in

Post as a guest

Post as a guest

5 Answers 5

5 Answers 5

Sign up or log in

Post as a guest

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Sign up or log in

Post as a guest

Popular posts from this blog

Grendel Contents Story Scholarship Depictions Notes References Navigation menu10.1093/notesj/gjn112Berserkeree

Log in Navigation menu

5 Answers
5

5 Answers
5

5 Answers
5