fixed issue #209 (CCGT power plant 4079) defined as STORE instead of PP #221

gincrement · 2025-01-29T13:59:31Z

Closes # (if applicable).

Changes proposed in this Pull Request

Checklist

[ x] Code changes are sufficiently documented; i.e. new functions contain docstrings and further explanations may be given in doc.
Unit tests for new features were added (if applicable).
A note for the release notes doc/release_notes.rst of the upcoming release is included.
[ x] I consent to the release of this PR's code under the MIT license.

…d of PP

lkstrp

powerplant.csv' is generated automatically. This is basically the standard output of powerplantsmatching, so you can use the data without re-running it.
Which means that fixing things in here will just be overwritten by the next release. We would need to look at GEM and GPD, see where the problem is coming from and make a manual adjustment (manual_corrections.csv) or improve the general pre-processing (data.py)

for more information, see https://pre-commit.ci

lkstrp · 2025-01-30T09:19:18Z

powerplantmatching/data.py

+    # fix a bug within the data source GPD related with the power plant at 'Creyke Beck'
+    # Technology: CCGT  -> Combustion Engine
+    # Set:        Store -> PP
+    df.loc[df["Gppd_Idnr"] == "GBR2001173", "Technology"] = "Combustion Engine"
+    df.loc[df["Gppd_Idnr"] == "GBR2001173", "Set"] = "PP"
+


Can you just add that in here?
https://github.com/PyPSA/powerplantmatching/blob/master/powerplantmatching/package_data/manual_corrections.csv

It will basically do the same, those changes will be updated via

powerplantmatching/powerplantmatching/utils.py

Lines 143 to 172 in f593581

def correct_manually(df, name, config=None):

"""

Update powerplant data based on stored corrections in

powerplantmatching/data/in/manual_corrections.csv. Specify the name

of the data by the second argument.

Parameters

----------

df : pandas.DataFrame

Powerplant data

name : str

Name of the data source, should be in columns of manual_corrections.csv

"""

if config is None:

config = get_config()

corrections_fn = _package_data("manual_corrections.csv")

corrections = pd.read_csv(corrections_fn)

corrections = (

corrections.query("Source == @name")

.drop(columns="Source")

.set_index("projectID")

)

if corrections.empty:

return df

df = df.set_index("projectID").copy()

df.update(corrections)

return df.reset_index()

It is just much cleaner if we keep single manual corrections in a file, instead of bloating up data.csv which should just contain general preprocessing logic

Could be done if I am allowed to add a column as well.

True, but yes go for it. The function looks already generic enough

…nstead of Combustion Engine/PP

lkstrp · 2025-01-30T14:55:45Z

Thank you @gincrement !

fixed issue PyPSA#209 (CCGT power plant 4079) defined as STORE instea…

c50e92e

…d of PP

lkstrp requested changes Jan 29, 2025

View reviewed changes

gincrement and others added 2 commits January 29, 2025 20:36

bug fix for issue PyPSA#209 via adjustment in data.py in function GPD()

1f16879

[pre-commit.ci] auto fixes from pre-commit.com hooks

0b3380a

for more information, see https://pre-commit.ci

lkstrp reviewed Jan 30, 2025

View reviewed changes

fixed issue PyPSA#209 (CCGT power plant 4079) defined as CCGT/STORE i…

8ea84b8

…nstead of Combustion Engine/PP

lkstrp merged commit bc7c047 into PyPSA:master Jan 30, 2025
14 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed issue #209 (CCGT power plant 4079) defined as STORE instead of PP #221

fixed issue #209 (CCGT power plant 4079) defined as STORE instead of PP #221

gincrement commented Jan 29, 2025

lkstrp left a comment

lkstrp Jan 30, 2025

gincrement Jan 30, 2025

lkstrp Jan 30, 2025

lkstrp commented Jan 30, 2025

	def correct_manually(df, name, config=None):
	"""
	Update powerplant data based on stored corrections in
	powerplantmatching/data/in/manual_corrections.csv. Specify the name
	of the data by the second argument.

	Parameters
	----------
	df : pandas.DataFrame
	Powerplant data
	name : str
	Name of the data source, should be in columns of manual_corrections.csv
	"""
	if config is None:
	config = get_config()

	corrections_fn = _package_data("manual_corrections.csv")
	corrections = pd.read_csv(corrections_fn)

	corrections = (
	corrections.query("Source == @name")
	.drop(columns="Source")
	.set_index("projectID")
	)
	if corrections.empty:
	return df

	df = df.set_index("projectID").copy()
	df.update(corrections)
	return df.reset_index()

fixed issue #209 (CCGT power plant 4079) defined as STORE instead of PP #221

fixed issue #209 (CCGT power plant 4079) defined as STORE instead of PP #221

Conversation

gincrement commented Jan 29, 2025

Changes proposed in this Pull Request

Checklist

lkstrp left a comment

Choose a reason for hiding this comment

lkstrp Jan 30, 2025

Choose a reason for hiding this comment

gincrement Jan 30, 2025

Choose a reason for hiding this comment

lkstrp Jan 30, 2025

Choose a reason for hiding this comment

lkstrp commented Jan 30, 2025