Welcome! The PUDL data viewer is in beta right now. We filmed a short tutorial video, and would love any feedback you have for the site!

out_eia__monthly_generators

package: pudl

Monthly time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

report_date, plant_id_eia, generator_id

Usage Warnings

  • Contains estimated values.

  • Due to coverage problems in other tables, some generators have heat rate estimates but not fuel cost estimates.

  • Not all columns are originally reported in or calculable from the input tables. Expect nulls.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This table includes all monthly attributes for all generators reported to EIA-860 and EIA-923.

To provide a complete picture of generator data, this table compiles data from many different EIA tables, including directly reported, imputed and calculated columns. We suggest using this table if you want to explore monthly attributes about generators and would rather use a more complete and denormalized table. If you are more interested in the originally reported values, we recommend searching for core_eia with the column you are most interested in.

The calculations of capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh are based on the allocation of net generation reported on the basis of plant, prime mover and energy source to individual generators - which comes from this table: out_eia923__monthly_generation_fuel_by_generator.

Heat rates by generator-monthly are estimated by using allocated estimates for per-generator net generation and fuel consumption as well as the core_eia923__monthly_boiler_fuel table, which reports fuel consumed by boiler. Heat rates are necessary to estimate the amount of fuel consumed by a generation unit, and thus the fuel cost per MWh generated.

Plant specific fuel prices are taken from the core_eia923__monthly_fuel_receipts_costs table, which only has ~70% coverage, leading to some generators with heat rate estimates still lacking fuel cost estimates.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

technology_description

High level description of the technology used by the generator to produce electricity.

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

generator_operating_date

Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.

generator_retirement_date

Date of the scheduled or effected retirement of the generator.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

planned_generator_retirement_date

Planned effective date of the scheduled retirement of the generator.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

fuel_cost_per_mmbtu_source

Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_cost_per_mwh

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

unit_heat_rate_mmbtu_per_mwh

Fuel content per unit of electricity generated. Coming from MCOE calculation.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

total_mmbtu

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

associated_combined_heat_power

Indicates whether the generator is associated with a combined heat and power system

bga_source

The source from where the unit_id_pudl is compiled. The unit_id_pudl comes directly from EIA 860, or string association (which looks at all the boilers and generators that are not associated with a unit and tries to find a matching string in the respective collection of boilers or generator), or from a unit connection (where the unit_id_eia is employed to find additional boiler generator connections).

bypass_heat_recovery

Can this generator operate while bypassing the heat recovery steam generator?

carbon_capture

Indicates whether the generator uses carbon capture technology.

city

Name of the city.

can_cofire_fuels

Whether the generator can co-fire fuels.

county

County name.

current_planned_generator_operating_date

The most recently updated effective date on which the generator is scheduled to start operation

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

deliver_power_transgrid

Indicate whether the generator can deliver power to the transmission grid.

distributed_generation

Whether the generator is considered distributed generation

duct_burners

Indicates whether the unit has duct-burners for supplementary firing of the turbine exhaust gas

energy_source_1_transport_1

Primary mode of transport for energy source 1.

energy_source_1_transport_2

Secondary mode of transport for energy source 1.

energy_source_1_transport_3

Tertiary mode of transport for energy source 1.

energy_source_2_transport_1

Primary mode of transport for energy source 2.

energy_source_2_transport_2

Secondary mode of transport for energy source 2.

energy_source_2_transport_3

Tertiary mode of transport for energy source 2.

energy_source_code_2

The code representing the second most predominant type of energy that fuels the generator

energy_source_code_3

The code representing the third most predominant type of energy that fuels the generator

energy_source_code_4

The code representing the fourth most predominant type of energy that fuels the generator

energy_source_code_5

The code representing the fifth most predominant type of energy that fuels the generator

energy_source_code_6

The code representing the sixth most predominant type of energy that fuels the generator

energy_storage_capacity_mwh

Energy storage capacity in MWh (e.g. for batteries).

ferc_qualifying_facility

Indicates whether or not a generator is a qualifying FERC cogeneration facility.

fluidized_bed_tech

Indicates whether the generator uses fluidized bed technology

fuel_type_count

A count of how many different simple energy sources there are associated with a generator.

latitude

Latitude of the plant's location, in degrees.

longitude

Longitude of the plant's location, in degrees.

minimum_load_mw

The minimum load at which the generator can operate at continuosuly.

can_burn_multiple_fuels

Whether the generator can burn multiple fuels.

nameplate_power_factor

The nameplate power factor of the generator.

net_capacity_mwdc

Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.

can_switch_when_operating

Indicates whether a fuel switching generator can switch fuels while operating.

operational_status_code

The operating status of the asset.

original_planned_generator_operating_date

The date the generator was originally scheduled to be operational

other_combustion_tech

Indicates whether the generator uses other combustion technologies

other_modifications_date

Planned effective date that the generator is scheduled to enter commercial operation after any other planned modification is complete.

other_planned_modifications

Indicates whether there are there other modifications planned for the generator.

owned_by_non_utility

Whether any part of generator is owned by a nonutilty

ownership_code

Identifies the ownership for each generator.

planned_derate_date

Planned effective month that the generator is scheduled to enter operation after the derate modification.

planned_energy_source_code_1

New energy source code for the planned repowered generator.

planned_modifications

Indicates whether there are any planned capacity uprates/derates, repowering, other modifications, or generator retirements scheduled for the next 5 years.

planned_net_summer_capacity_derate_mw

Decrease in summer capacity expected to be realized from the derate modification to the equipment.

planned_net_summer_capacity_uprate_mw

Increase in summer capacity expected to be realized from the modification to the equipment.

planned_net_winter_capacity_derate_mw

Decrease in winter capacity expected to be realized from the derate modification to the equipment.

planned_net_winter_capacity_uprate_mw

Increase in winter capacity expected to be realized from the uprate modification to the equipment.

planned_new_capacity_mw

The expected new namplate capacity for the generator.

planned_new_prime_mover_code

New prime mover for the planned repowered generator.

planned_repower_date

Planned effective date that the generator is scheduled to enter operation after the repowering is complete.

planned_uprate_date

Planned effective date that the generator is scheduled to enter operation after the uprate modification.

previously_canceled

Indicates whether the generator was previously reported as indefinitely postponed or canceled

pulverized_coal_tech

Indicates whether the generator uses pulverized coal technology

reactive_power_output_mvar

Reactive Power Output (MVAr)

rto_iso_lmp_node_id

The designation used to identify the price node in RTO/ISO Locational Marginal Price reports

rto_iso_location_wholesale_reporting_id

The designation used to report the specific location of the wholesale sales transactions to FERC for the Electric Quarterly Report

solid_fuel_gasification

Indicates whether the generator is part of a solid fuel gasification system

startup_source_code_1

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_2

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_3

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_4

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

state

Two letter US state abbreviation.

stoker_tech

Indicates whether the generator uses stoker technology

street_address

Physical street address.

subcritical_tech

Indicates whether the generator uses subcritical technology

summer_capacity_estimate

Whether the summer capacity value was an estimate

summer_capacity_mw

The net summer capacity.

summer_estimated_capability_mw

EIA estimated summer capacity (in MWh).

supercritical_tech

Indicates whether the generator uses supercritical technology

can_switch_oil_gas

Whether the generator can switch between oil and natural gas.

synchronized_transmission_grid

Indicates whether standby generators (SB status) can be synchronized to the grid.

time_cold_shutdown_full_load_code

The minimum amount of time required to bring the unit to full load from shutdown.

timezone

IANA timezone name

topping_bottoming_code

If the generator is associated with a combined heat and power system, indicates whether the generator is part of a topping cycle or a bottoming cycle

turbines_inverters_hydrokinetics

Number of wind turbines, or hydrokinetic buoys.

turbines_num

Number of wind turbines, or hydrokinetic buoys.

ultrasupercritical_tech

Indicates whether the generator uses ultra-supercritical technology

uprate_derate_completed_date

The date when the uprate or derate was completed.

uprate_derate_during_year

Was an uprate or derate completed on this generator during the reporting year?

winter_capacity_estimate

Whether the winter capacity value was an estimate

winter_capacity_mw

The net winter capacity.

winter_estimated_capability_mw

EIA estimated winter capacity (in MWh).

zip_code

Five digit US Zip Code.

out_eia923__fuel_receipts_costs

package: pudl

fuel deliveries to power plants.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 2 - Part A)

Primary key:

This table has no primary key. Each record describes an individual fuel delivery. There can be multiple deliveries of the same type of fuel from the same supplier to the same plant in a single month, so the table has no natural primary key.

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Some values have been redacted.

  • Time of fuel deliveries is not necessarily connected with time of fuel consumption.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

There can be a significant delay between the receipt of fuel and its consumption, so using this table to infer monthly attributes associated with power generation may not be entirely accurate. However, this is the most granular data we have describing fuel costs, and we use it in calculating the marginal cost of electricity for individual generation units.

Under some circumstances utilities are allowed to treat the price of fuel as proprietary business data, meaning it is redacted from the publicly available spreadsheets. It's still reported to EIA and influences the aggregated (state, region, annual, etc.) fuel prices they publish. From 2009-2021 about 1/3 of all prices are redacted. The missing data is not randomly distributed. Deregulated markets dominated by merchant generators (independent power producers) redact much more data, and natural gas is by far the most likely fuel to have its price redacted. This means, for instance, that the entire Northeastern US reports essentially no fine-grained data about its natural gas prices.

Additional data which we haven't yet integrated is available in a similar format from 2002-2008 via the EIA-423, and going back as far as 1972 from the FERC-423.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

state

Two letter US state abbreviation.

contract_type_code

Purchase type under which receipts occurred in the reporting month. C: Contract, NC: New Contract, S: Spot Purchase, T: Tolling Agreement.

contract_expiration_date

Date contract expires.Format: MMYY.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_group_code

Fuel groups used in the Electric Power Monthly

supplier_name

Company that sold the fuel to the plant or, in the case of Natural Gas, pipeline owner.

fuel_received_units

Quantity of fuel received in tons, barrel, or Mcf.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

bulk_agg_fuel_cost_per_mmbtu

Fuel cost per mmbtu reported in the EIA bulk electricity data. This is an aggregate average fuel price for a whole state, region, month, sector, etc. Used to fill in missing fuel prices.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

fuel_cost_per_mmbtu_source

Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).

sulfur_content_pct

Sulfur content percentage by weight to the nearest 0.01 percent.

ash_content_pct

Ash content percentage by weight to the nearest 0.1 percent.

mercury_content_ppm

Mercury content in parts per million (ppm) to the nearest 0.001 ppm.

primary_transportation_mode_code

Transportation mode for the longest distance transported.

secondary_transportation_mode_code

Transportation mode for the second longest distance transported.

natural_gas_transport_code

Contract type for natural gas transportation service.

natural_gas_delivery_contract_type_code

Contract type for natural gas delivery service:

moisture_content_pct

For coal only: the moisture content of the fuel in terms of moisture percentage by weight. Reported to the nearest 0.01 percent.

chlorine_content_ppm

For coal only: the chlorine content in parts per million (ppm) to the nearest 0.001 ppm. If lab tests of the coal do not include the chlorine content, this field contains the amount specified in the contract with the supplier.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

mine_id_msha

MSHA issued mine identifier.

mine_name

Coal mine name.

mine_state

State where the coal mine is located. Two letter abbreviation.

coalmine_county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4. This is the county where the coal mine is located.

mine_type_code

Type of coal mine.

out_ferc1__yearly_all_plants

package: pudl

Annual time series of plants reported in the steam, small generators, hydro, and pumped storage tables.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedules 402, 404, 406 and 408)

Primary key:

record_id

Usage Warnings

  • Not all columns are originally reported in all of the input plant tables. Expect nulls.

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table is a concatenation of the following plant tables:

  • core_ferc1__yearly_steam_plants_sched402

  • core_ferc1__yearly_hydroelectric_plants_sched406

  • core_ferc1__yearly_small_plants_sched410

  • core_ferc1__yearly_pumped_storage_plants_sched408

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_id_ferc1

Algorithmically assigned PUDL FERC Plant ID. WARNING: NOT STABLE BETWEEN PUDL DB INITIALIZATIONS.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

asset_retirement_cost

Asset retirement cost (USD).

avg_num_employees

The average number of employees assigned to each plant.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

capex_annual_addition

Annual capital addition into capex_total.

capex_annual_addition_rolling

Year-to-date capital addition into capex_total.

capex_annual_per_kw

Annual capital addition into capex_total per kw.

capex_annual_per_mw

Annual capital addition into capex_total per MW.

capex_annual_per_mw_rolling

Year-to-date capital addition into capex_total per MW.

capex_annual_per_mwh

Annual capital addition into capex_total per MWh.

capex_annual_per_mwh_rolling

Year-to-date capital addition into capex_total per MWh.

capex_equipment

Cost of plant: equipment (USD).

capex_land

Cost of plant: land and land rights (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_structures

Cost of plant: structures and improvements (USD).

capex_total

Total cost of plant (USD).

capex_wo_retirement_total

Total cost of plant (USD) without retirements.

construction_type

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

Year the plant's oldest still operational unit was built.

installation_year

Year the plant's most recently built unit was installed.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

not_water_limited_capacity_mw

Plant capacity in MW when not limited by condenser water.

opex_allowances

Allowances.

opex_boiler

Maintenance of boiler (or reactor) plant.

opex_coolants

Cost of coolants and water (nuclear plants only)

opex_electric

Production expenses: electric expenses (USD).

opex_engineering

Production expenses: maintenance, supervision, and engineering (USD).

opex_fuel

Production expenses: fuel (USD).

fuel_cost_per_mwh

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

opex_misc_power

Miscellaneous steam (or nuclear) expenses.

opex_misc_steam

Maintenance of miscellaneous steam (or nuclear) plant.

opex_nonfuel_per_mwh

Investments in non-fuel production expenses per Mwh.

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_per_mwh

Total production expenses (USD per MWh generated).

opex_plant

Production expenses: maintenance of electric plant (USD).

opex_production_total

Total operating expenses.

opex_rents

Production expenses: rents (USD).

opex_steam

Steam expenses.

opex_steam_other

Steam from other sources.

opex_structures

Production expenses: maintenance of structures (USD).

opex_total_nonfuel

Total production expenses, excluding fuel (USD).

opex_transfer

Steam transferred (Credit).

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_capability_mw

Net plant capability in megawatts.

plant_hours_connected_while_generating

Hours the plant was connected to load while generating in the report year.

plant_type

Type of plant.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

water_limited_capacity_mw

Plant capacity in MW when limited by condenser water.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_type

Type of fuel.

license_id_ferc1

FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.

opex_maintenance

Production expenses: Maintenance (USD).

opex_total

Total production expenses, excluding fuel (USD).

capex_facilities

Cost of plant: reservoirs, dams, and waterways (USD).

capex_roads

Cost of plant: roads, railroads, and bridges (USD).

net_capacity_adverse_conditions_mw

Net plant capability under the least favorable operating conditions, in megawatts.

net_capacity_favorable_conditions_mw

Net plant capability under the most favorable operating conditions, in megawatts.

opex_dams

Production expenses: maintenance of reservoirs, dams, and waterways (USD).

opex_generation_misc

Production expenses: miscellaneous power generation expenses (USD).

opex_hydraulic

Production expenses: hydraulic expenses (USD).

opex_misc_plant

Production expenses: maintenance of miscellaneous hydraulic plant (USD).

opex_water_for_power

Production expenses: water for power (USD).

ferc_license_id

The FERC license ID of a project.

capex_equipment_electric

Cost of plant: accessory electric equipment (USD).

capex_equipment_misc

Cost of plant: miscellaneous power plant equipment (USD).

capex_wheels_turbines_generators

Cost of plant: water wheels, turbines, and generators (USD).

energy_used_for_pumping_mwh

Energy used for pumping, in megawatt-hours.

net_load_mwh

Net output for load (net generation - energy used for pumping) in megawatt-hours.

opex_production_before_pumping

Total production expenses before pumping (USD).

opex_pumped_storage

Production expenses: pumped storage (USD).

opex_pumping

Production expenses: We are here to PUMP YOU UP! (USD).

out_eia__yearly_generators

package: pudl

Annual time series of all generator attributes including calculated capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

report_date, plant_id_eia, generator_id

Usage Warnings

  • Contains estimated values.

  • Due to coverage problems in other tables, some generators have heat rate estimates but not fuel cost estimates.

  • Not all columns are originally reported in or calculable from the input tables. Expect nulls.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This table includes all yearly attributes for all generators reported to EIA-860 and EIA-923.

To provide a complete picture of generator data, this table compiles data from many different EIA tables, including directly reported, imputed and calculated columns. We suggest using this table if you want to explore yearly attributes about generators and would rather use a more complete and denormalized table. If you are more interested in the originally reported values, we recommend searching for core_eia with the column you are most interested in.

The calculations of capacity factor, heat rate, fuel cost per MMBTU and fuel cost per MWh are based on the allocation of net generation reported on the basis of plant, prime mover and energy source to individual generators - which comes from this table: out_eia923__yearly_generation_fuel_by_generator.

Heat rates by generator-yearly are estimated by using allocated estimates for per-generator net generation and fuel consumption as well as the core_eia923__monthly_boiler_fuel table, which reports fuel consumed by boiler. Heat rates are necessary to estimate the amount of fuel consumed by a generation unit, and thus the fuel cost per MWh generated.

Plant specific fuel prices are taken from the core_eia923__monthly_fuel_receipts_costs table, which only has ~70% coverage, leading to some generators with heat rate estimates still lacking fuel cost estimates.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

technology_description

High level description of the technology used by the generator to produce electricity.

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

generator_operating_date

Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.

generator_retirement_date

Date of the scheduled or effected retirement of the generator.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

planned_generator_retirement_date

Planned effective date of the scheduled retirement of the generator.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

fuel_cost_per_mmbtu_source

Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_cost_per_mwh

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

unit_heat_rate_mmbtu_per_mwh

Fuel content per unit of electricity generated. Coming from MCOE calculation.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

total_mmbtu

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

associated_combined_heat_power

Indicates whether the generator is associated with a combined heat and power system

bga_source

The source from where the unit_id_pudl is compiled. The unit_id_pudl comes directly from EIA 860, or string association (which looks at all the boilers and generators that are not associated with a unit and tries to find a matching string in the respective collection of boilers or generator), or from a unit connection (where the unit_id_eia is employed to find additional boiler generator connections).

bypass_heat_recovery

Can this generator operate while bypassing the heat recovery steam generator?

carbon_capture

Indicates whether the generator uses carbon capture technology.

city

Name of the city.

can_cofire_fuels

Whether the generator can co-fire fuels.

county

County name.

current_planned_generator_operating_date

The most recently updated effective date on which the generator is scheduled to start operation

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

deliver_power_transgrid

Indicate whether the generator can deliver power to the transmission grid.

distributed_generation

Whether the generator is considered distributed generation

duct_burners

Indicates whether the unit has duct-burners for supplementary firing of the turbine exhaust gas

energy_source_1_transport_1

Primary mode of transport for energy source 1.

energy_source_1_transport_2

Secondary mode of transport for energy source 1.

energy_source_1_transport_3

Tertiary mode of transport for energy source 1.

energy_source_2_transport_1

Primary mode of transport for energy source 2.

energy_source_2_transport_2

Secondary mode of transport for energy source 2.

energy_source_2_transport_3

Tertiary mode of transport for energy source 2.

energy_source_code_2

The code representing the second most predominant type of energy that fuels the generator

energy_source_code_3

The code representing the third most predominant type of energy that fuels the generator

energy_source_code_4

The code representing the fourth most predominant type of energy that fuels the generator

energy_source_code_5

The code representing the fifth most predominant type of energy that fuels the generator

energy_source_code_6

The code representing the sixth most predominant type of energy that fuels the generator

energy_storage_capacity_mwh

Energy storage capacity in MWh (e.g. for batteries).

ferc_qualifying_facility

Indicates whether or not a generator is a qualifying FERC cogeneration facility.

fluidized_bed_tech

Indicates whether the generator uses fluidized bed technology

fuel_type_count

A count of how many different simple energy sources there are associated with a generator.

latitude

Latitude of the plant's location, in degrees.

longitude

Longitude of the plant's location, in degrees.

minimum_load_mw

The minimum load at which the generator can operate at continuosuly.

can_burn_multiple_fuels

Whether the generator can burn multiple fuels.

nameplate_power_factor

The nameplate power factor of the generator.

net_capacity_mwdc

Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.

can_switch_when_operating

Indicates whether a fuel switching generator can switch fuels while operating.

operational_status_code

The operating status of the asset.

original_planned_generator_operating_date

The date the generator was originally scheduled to be operational

other_combustion_tech

Indicates whether the generator uses other combustion technologies

other_modifications_date

Planned effective date that the generator is scheduled to enter commercial operation after any other planned modification is complete.

other_planned_modifications

Indicates whether there are there other modifications planned for the generator.

owned_by_non_utility

Whether any part of generator is owned by a nonutilty

ownership_code

Identifies the ownership for each generator.

planned_derate_date

Planned effective month that the generator is scheduled to enter operation after the derate modification.

planned_energy_source_code_1

New energy source code for the planned repowered generator.

planned_modifications

Indicates whether there are any planned capacity uprates/derates, repowering, other modifications, or generator retirements scheduled for the next 5 years.

planned_net_summer_capacity_derate_mw

Decrease in summer capacity expected to be realized from the derate modification to the equipment.

planned_net_summer_capacity_uprate_mw

Increase in summer capacity expected to be realized from the modification to the equipment.

planned_net_winter_capacity_derate_mw

Decrease in winter capacity expected to be realized from the derate modification to the equipment.

planned_net_winter_capacity_uprate_mw

Increase in winter capacity expected to be realized from the uprate modification to the equipment.

planned_new_capacity_mw

The expected new namplate capacity for the generator.

planned_new_prime_mover_code

New prime mover for the planned repowered generator.

planned_repower_date

Planned effective date that the generator is scheduled to enter operation after the repowering is complete.

planned_uprate_date

Planned effective date that the generator is scheduled to enter operation after the uprate modification.

previously_canceled

Indicates whether the generator was previously reported as indefinitely postponed or canceled

pulverized_coal_tech

Indicates whether the generator uses pulverized coal technology

reactive_power_output_mvar

Reactive Power Output (MVAr)

rto_iso_lmp_node_id

The designation used to identify the price node in RTO/ISO Locational Marginal Price reports

rto_iso_location_wholesale_reporting_id

The designation used to report the specific location of the wholesale sales transactions to FERC for the Electric Quarterly Report

solid_fuel_gasification

Indicates whether the generator is part of a solid fuel gasification system

startup_source_code_1

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_2

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_3

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_4

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

state

Two letter US state abbreviation.

stoker_tech

Indicates whether the generator uses stoker technology

street_address

Physical street address.

subcritical_tech

Indicates whether the generator uses subcritical technology

summer_capacity_estimate

Whether the summer capacity value was an estimate

summer_capacity_mw

The net summer capacity.

summer_estimated_capability_mw

EIA estimated summer capacity (in MWh).

supercritical_tech

Indicates whether the generator uses supercritical technology

can_switch_oil_gas

Whether the generator can switch between oil and natural gas.

synchronized_transmission_grid

Indicates whether standby generators (SB status) can be synchronized to the grid.

time_cold_shutdown_full_load_code

The minimum amount of time required to bring the unit to full load from shutdown.

timezone

IANA timezone name

topping_bottoming_code

If the generator is associated with a combined heat and power system, indicates whether the generator is part of a topping cycle or a bottoming cycle

turbines_inverters_hydrokinetics

Number of wind turbines, or hydrokinetic buoys.

turbines_num

Number of wind turbines, or hydrokinetic buoys.

ultrasupercritical_tech

Indicates whether the generator uses ultra-supercritical technology

uprate_derate_completed_date

The date when the uprate or derate was completed.

uprate_derate_during_year

Was an uprate or derate completed on this generator during the reporting year?

winter_capacity_estimate

Whether the winter capacity value was an estimate

winter_capacity_mw

The net winter capacity.

winter_estimated_capability_mw

EIA estimated winter capacity (in MWh).

zip_code

Five digit US Zip Code.

out_censusdp1tract__counties

package: pudl
Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

Census DP1 -- Profile of General Demographic Characteristics

Primary key:

county_id_fips

Additional Details

County-level summarized US Census demographic information, along with a geometry describing county boundaries. Based on the 2010 Census. Used as a convenient source of geospatial information within PUDL, but should not be treated as authoritative.

Columns
county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4.

geometry

Geospatial representation of the feature.

county

County name.

functional_status_code_census

The functional status (FUNCSTAT) code defines the current functional status of a geographic entity. These codes can be found in the TIGER/Line products, gazetteer files, and other products.

land_area

Land area in square meters.

water_area

Water area in square meters.

internal_point_latitude

Internal point latitude in decimal degrees.

internal_point_longitude

Internal point longitude in decimal degrees.

shape_length

Length of the feature's perimeter in degrees.

shape_area

Area of the feature in square degrees.

dp0010001

DPSF1. Sex and age - Total population

dp0010002

DPSF1. Sex and age - Total population under 5 years

dp0010003

DPSF1. Sex and age - Total population 5 to 9 years

dp0010004

DPSF1. Sex and age - Total population 10 to 14 years

dp0010005

DPSF1. Sex and age - Total population 15 to 19 years

dp0010006

DPSF1. Sex and age - Total population 20 to 24 years

dp0010007

DPSF1. Sex and age - Total population 25 to 29 years

dp0010008

DPSF1. Sex and age - Total population 30 to 34 years

dp0010009

DPSF1. Sex and age - Total population 35 to 39 years

dp0010010

DPSF1. Sex and age - Total population 40 to 44 years

dp0010011

DPSF1. Sex and age - Total population 45 to 49 years

dp0010012

DPSF1. Sex and age - Total population 50 to 54 years

dp0010013

DPSF1. Sex and age - Total population 55 to 59 years

dp0010014

DPSF1. Sex and age - Total population 60 to 64 years

dp0010015

DPSF1. Sex and age - Total population 65 to 69 years

dp0010016

DPSF1. Sex and age - Total population 70 to 74 years

dp0010017

DPSF1. Sex and age - Total population 75 to 79 years

dp0010018

DPSF1. Sex and age - Total population 80 to 84 years

dp0010019

DPSF1. Sex and age - Total population 85 years and over

dp0010020

DPSF1. Sex and age - Male population total

dp0010021

DPSF1. Sex and age - Male population under 5 years

dp0010022

DPSF1. Sex and age - Male population 5 to 9 years

dp0010023

DPSF1. Sex and age - Male population 10 to 14 years

dp0010024

DPSF1. Sex and age - Male population 15 to 19 years

dp0010025

DPSF1. Sex and age - Male population 20 to 24 years

dp0010026

DPSF1. Sex and age - Male population 25 to 29 years

dp0010027

DPSF1. Sex and age - Male population 30 to 34 years

dp0010028

DPSF1. Sex and age - Male population 35 to 39 years

dp0010029

DPSF1. Sex and age - Male population 40 to 44 years

dp0010030

DPSF1. Sex and age - Male population 45 to 49 years

dp0010031

DPSF1. Sex and age - Male population 50 to 54 years

dp0010032

DPSF1. Sex and age - Male population 55 to 59 years

dp0010033

DPSF1. Sex and age - Male population 60 to 64 years

dp0010034

DPSF1. Sex and age - Male population 65 to 69 years

dp0010035

DPSF1. Sex and age - Male population 70 to 74 years

dp0010036

DPSF1. Sex and age - Male population 75 to 79 years

dp0010037

DPSF1. Sex and age - Male population 80 to 84 years

dp0010038

DPSF1. Sex and age - Male population 85 years and over

dp0010039

DPSF1. Sex and age - Female population total

dp0010040

DPSF1. Sex and age - Female population under 5 years

dp0010041

DPSF1. Sex and age - Female population 5 to 9 years

dp0010042

DPSF1. Sex and age - Female population 10 to 14 years

dp0010043

DPSF1. Sex and age - Female population 15 to 19 years

dp0010044

DPSF1. Sex and age - Female population 20 to 24 years

dp0010045

DPSF1. Sex and age - Female population 25 to 29 years

dp0010046

DPSF1. Sex and age - Female population 30 to 34 years

dp0010047

DPSF1. Sex and age - Female population 35 to 39 years

dp0010048

DPSF1. Sex and age - Female population 40 to 44 years

dp0010049

DPSF1. Sex and age - Female population 45 to 49 years

dp0010050

DPSF1. Sex and age - Female population 50 to 54 years

dp0010051

DPSF1. Sex and age - Female population 55 to 59 years

dp0010052

DPSF1. Sex and age - Female population 60 to 64 years

dp0010053

DPSF1. Sex and age - Female population 65 to 69 years

dp0010054

DPSF1. Sex and age - Female population 70 to 74 years

dp0010055

DPSF1. Sex and age - Female population 75 to 79 years

dp0010056

DPSF1. Sex and age - Female population 80 to 84 years

dp0010057

DPSF1. Sex and age - Female population 85 years and over

dp0020001

DPSF2. Median age by sex - Both sexes

dp0020002

DPSF2. Median age by sex - Male

dp0020003

DPSF2. Median age by sex - Female

dp0030001

DPSF3. Sex for population 16 years and over - Total

dp0030002

DPSF3. Sex for population 16 years and over - Male

dp0030003

DPSF3. Sex for population 16 years and over - Female

dp0040001

DPSF4. Sex for population 18 years and over - Total

dp0040002

DPSF4. Sex for population 18 years and over - Male

dp0040003

DPSF4. Sex for population 18 years and over - Female

dp0050001

DPSF5. Sex for population 21 years and over - Total

dp0050002

DPSF5. Sex for population 21 years and over - Male

dp0050003

DPSF5. Sex for population 21 years and over - Female

dp0060001

DPSF6. Sex for population 62 years and over - Total

dp0060002

DPSF6. Sex for population 62 years and over - Male

dp0060003

DPSF6. Sex for population 62 years and over - Female

dp0070001

DPSF7. Sex for population 65 years and over - Total

dp0070002

DPSF7. Sex for population 65 years and over - Male

dp0070003

DPSF7. Sex for population 65 years and over - Female

dp0080001

DPSF8. Race - Total population

dp0080002

DPSF8. Race - Population of one race

dp0080003

DPSF8. Race - White

dp0080004

DPSF8. Race - Black or African American

dp0080005

DPSF8. Race - American Indian and Alaska Native

dp0080006

DPSF8. Race - Asian total

dp0080007

DPSF8. Race - Asian Indian

dp0080008

DPSF8. Race - Chinese

dp0080009

DPSF8. Race - Filipino

dp0080010

DPSF8. Race - Japanese

dp0080011

DPSF8. Race - Korean

dp0080012

DPSF8. Race - Vietnamese

dp0080013

DPSF8. Race - Other Asian

dp0080014

DPSF8. Race - Native Hawaiian and Other Pacific Islander total

dp0080015

DPSF8. Race - Native Hawaiian

dp0080016

DPSF8. Race - Guamanian or Chamorro

dp0080017

DPSF8. Race - Samoan

dp0080018

DPSF8. Race - Other Pacific Islander

dp0080019

DPSF8. Race - Some Other Race

dp0080020

DPSF8. Race - Population of Two or More Races

dp0080021

DPSF8. Race - White; American Indian and Alaska Native

dp0080022

DPSF8. Race - White; Asian

dp0080023

DPSF8. Race - White; Black or African American

dp0080024

DPSF8. Race - White; Some Other Race

dp0090001

DPSF9. Race (total races tallied) - White alone or in combination with one or more other races

dp0090002

DPSF9. Race (total races tallied) - Black or African American alone or in combination with one or more other races

dp0090003

DPSF9. Race (total races tallied) - American Indian and Alaska Native alone or in combination with one or more other races

dp0090004

DPSF9. Race (total races tallied) - Asian alone or in combination with one or more other races

dp0090005

DPSF9. Race (total races tallied) - Native Hawaiian and Other Pacific Islander alone or in combination with one or more other races

dp0090006

DPSF9. Race (total races tallied) - Some Other Race alone or in combination with one or more other races

dp0100001

DPSF10. Hispanic or Latino by specific origin - Total population

dp0100002

DPSF10. Hispanic or Latino by specific origin - Hispanic or Latino (of any race)

dp0100003

DPSF10. Hispanic or Latino by specific origin - Mexican

dp0100004

DPSF10. Hispanic or Latino by specific origin - Puerto Rican

dp0100005

DPSF10. Hispanic or Latino by specific origin - Cuban

dp0100006

DPSF10. Hispanic or Latino by specific origin - Other Hispanic or Latino

dp0100007

DPSF10. Hispanic or Latino by specific origin - Not Hispanic or Latino

dp0110001

DPSF11. Hispanic or Latino and race - Total population

dp0110002

DPSF11. Hispanic or Latino and race - Hispanic or Latino total

dp0110003

DPSF11. Hispanic or Latino and race - Hispanic or Latino: White alone

dp0110004

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Black or African American alone

dp0110005

DPSF11. Hispanic or Latino and race - Hispanic or Latino: American Indian and Alaska Native alone

dp0110006

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Asian alone

dp0110007

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone

dp0110008

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Some Other Race alone

dp0110009

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Two or More Races

dp0110010

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino total

dp0110011

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: White alone

dp0110012

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Black or African American alone

dp0110013

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: American Indian and Alaska Native alone

dp0110014

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Asian alone

dp0110015

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone

dp0110016

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Some Other Race alone

dp0110017

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Two or More Races

dp0120001

DPSF12. Relationship - Total population

dp0120002

DPSF12. Relationship - In households

dp0120003

DPSF12. Relationship - Householder

dp0120004

DPSF12. Relationship - Spouse

dp0120005

DPSF12. Relationship - Child

dp0120006

DPSF12. Relationship - Own child under 18 years

dp0120007

DPSF12. Relationship - Other relatives

dp0120008

DPSF12. Relationship - Other relatives under 18 years

dp0120009

DPSF12. Relationship - Other relatives 65 years and over

dp0120010

DPSF12. Relationship - Nonrelatives

dp0120011

DPSF12. Relationship - Nonrelatives under 18 years

dp0120012

DPSF12. Relationship - Nonrelatives 65 years and over

dp0120013

DPSF12. Relationship - Unmarried partner

dp0120014

DPSF12. Relationship - In group quarters

dp0120015

DPSF12. Relationship - Institutionalized population

dp0120016

DPSF12. Relationship - Institutionalized population: Male

dp0120017

DPSF12. Relationship - Institutionalized population: Female

dp0120018

DPSF12. Relationship - Noninstitutionalized population

dp0120019

DPSF12. Relationship - Noninstitutionalized population: Male

dp0120020

DPSF12. Relationship - Noninstitutionalized population: Female

dp0130001

DPSF13. Households by type - Total households

dp0130002

DPSF13. Households by type - Family households (families)

dp0130003

DPSF13. Households by type - Family households with own children under 18 years

dp0130004

DPSF13. Households by type - Husband-wife family

dp0130005

DPSF13. Households by type - Husband-wife family with own children under 18 years

dp0130006

DPSF13. Households by type - Male householder, no wife present

dp0130007

DPSF13. Households by type - Male householder, no wife present, with own children under 18 years

dp0130008

DPSF13. Households by type - Female householder, no husband present

dp0130009

DPSF13. Households by type - Female householder, no husband present, with own children under 18 years

dp0130010

DPSF13. Households by type - Nonfamily households

dp0130011

DPSF13. Households by type - Householder living alone

dp0130012

DPSF13. Households by type - Householder living alone: Male

dp0130013

DPSF13. Households by type - Householder living alone: Male 65 years and over

dp0130014

DPSF13. Households by type - Householder living alone: Female

dp0130015

DPSF13. Households by type - Householder living alone: Female 65 years and over

dp0140001

DPSF14. Households with individuals under 18 years - Total

dp0150001

DPSF15. Households with individuals 65 years and over - Total

dp0160001

DPSF16. Average household size - Average household size

dp0170001

DPSF17. Average family size - Average family size

dp0180001

DPSF18. Housing occupancy - Total housing units

dp0180002

DPSF18. Housing occupancy - Occupied housing units

dp0180003

DPSF18. Housing occupancy - Vacant housing units

dp0180004

DPSF18. Housing occupancy - Vacant housing units for rent

dp0180005

DPSF18. Housing occupancy - Vacant housing units rented, not occupied

dp0180006

DPSF18. Housing occupancy - Vacant housing units for sale only

dp0180007

DPSF18. Housing occupancy - Vacant housing units sold, not occupied

dp0180008

DPSF18. Housing occupancy - Vacant housing units for seasonal, recreational, or occasional use

dp0180009

DPSF18. Housing occupancy - All other vacant housing units

dp0190001

DPSF19. Homeowner vacancy rate - Homeowner vacancy rate (percent)

dp0200001

DPSF20. Rental vacancy rate - Rental vacancy rate (percent)

dp0210001

DPSF21. Housing tenure - Total occupied housing units

dp0210002

DPSF21. Housing tenure - Owner-occupied housing units

dp0210003

DPSF21. Housing tenure - Renter-occupied housing units

dp0220001

DPSF22. Population in occupied housing units by tenure - Owner-occupied housing units

dp0220002

DPSF22. Population in occupied housing units by tenure - Renter-occupied housing units

dp0230001

DPSF23. Average household size by tenure - Owner occupied

dp0230002

DPSF23. Average household size by tenure - Renter occupied

out_censusdp1tract__states

package: pudl
Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

Census DP1 -- Profile of General Demographic Characteristics

Primary key:

state_id_fips

Additional Details

State-level summarized US Census demographic information, along with a geometry describing state boundaries. Based on the 2010 Census. Used as a convenient source of geospatial information within PUDL, but should not be treated as authoritative.

Columns
state_id_fips

Two digit state FIPS code.

geometry

Geospatial representation of the feature.

state_name

Full name of the state.

state

Two letter US state abbreviation.

land_area

Land area in square meters.

water_area

Water area in square meters.

internal_point_latitude

Internal point latitude in decimal degrees.

internal_point_longitude

Internal point longitude in decimal degrees.

shape_length

Length of the feature's perimeter in degrees.

shape_area

Area of the feature in square degrees.

dp0010001

DPSF1. Sex and age - Total population

dp0010002

DPSF1. Sex and age - Total population under 5 years

dp0010003

DPSF1. Sex and age - Total population 5 to 9 years

dp0010004

DPSF1. Sex and age - Total population 10 to 14 years

dp0010005

DPSF1. Sex and age - Total population 15 to 19 years

dp0010006

DPSF1. Sex and age - Total population 20 to 24 years

dp0010007

DPSF1. Sex and age - Total population 25 to 29 years

dp0010008

DPSF1. Sex and age - Total population 30 to 34 years

dp0010009

DPSF1. Sex and age - Total population 35 to 39 years

dp0010010

DPSF1. Sex and age - Total population 40 to 44 years

dp0010011

DPSF1. Sex and age - Total population 45 to 49 years

dp0010012

DPSF1. Sex and age - Total population 50 to 54 years

dp0010013

DPSF1. Sex and age - Total population 55 to 59 years

dp0010014

DPSF1. Sex and age - Total population 60 to 64 years

dp0010015

DPSF1. Sex and age - Total population 65 to 69 years

dp0010016

DPSF1. Sex and age - Total population 70 to 74 years

dp0010017

DPSF1. Sex and age - Total population 75 to 79 years

dp0010018

DPSF1. Sex and age - Total population 80 to 84 years

dp0010019

DPSF1. Sex and age - Total population 85 years and over

dp0010020

DPSF1. Sex and age - Male population total

dp0010021

DPSF1. Sex and age - Male population under 5 years

dp0010022

DPSF1. Sex and age - Male population 5 to 9 years

dp0010023

DPSF1. Sex and age - Male population 10 to 14 years

dp0010024

DPSF1. Sex and age - Male population 15 to 19 years

dp0010025

DPSF1. Sex and age - Male population 20 to 24 years

dp0010026

DPSF1. Sex and age - Male population 25 to 29 years

dp0010027

DPSF1. Sex and age - Male population 30 to 34 years

dp0010028

DPSF1. Sex and age - Male population 35 to 39 years

dp0010029

DPSF1. Sex and age - Male population 40 to 44 years

dp0010030

DPSF1. Sex and age - Male population 45 to 49 years

dp0010031

DPSF1. Sex and age - Male population 50 to 54 years

dp0010032

DPSF1. Sex and age - Male population 55 to 59 years

dp0010033

DPSF1. Sex and age - Male population 60 to 64 years

dp0010034

DPSF1. Sex and age - Male population 65 to 69 years

dp0010035

DPSF1. Sex and age - Male population 70 to 74 years

dp0010036

DPSF1. Sex and age - Male population 75 to 79 years

dp0010037

DPSF1. Sex and age - Male population 80 to 84 years

dp0010038

DPSF1. Sex and age - Male population 85 years and over

dp0010039

DPSF1. Sex and age - Female population total

dp0010040

DPSF1. Sex and age - Female population under 5 years

dp0010041

DPSF1. Sex and age - Female population 5 to 9 years

dp0010042

DPSF1. Sex and age - Female population 10 to 14 years

dp0010043

DPSF1. Sex and age - Female population 15 to 19 years

dp0010044

DPSF1. Sex and age - Female population 20 to 24 years

dp0010045

DPSF1. Sex and age - Female population 25 to 29 years

dp0010046

DPSF1. Sex and age - Female population 30 to 34 years

dp0010047

DPSF1. Sex and age - Female population 35 to 39 years

dp0010048

DPSF1. Sex and age - Female population 40 to 44 years

dp0010049

DPSF1. Sex and age - Female population 45 to 49 years

dp0010050

DPSF1. Sex and age - Female population 50 to 54 years

dp0010051

DPSF1. Sex and age - Female population 55 to 59 years

dp0010052

DPSF1. Sex and age - Female population 60 to 64 years

dp0010053

DPSF1. Sex and age - Female population 65 to 69 years

dp0010054

DPSF1. Sex and age - Female population 70 to 74 years

dp0010055

DPSF1. Sex and age - Female population 75 to 79 years

dp0010056

DPSF1. Sex and age - Female population 80 to 84 years

dp0010057

DPSF1. Sex and age - Female population 85 years and over

dp0020001

DPSF2. Median age by sex - Both sexes

dp0020002

DPSF2. Median age by sex - Male

dp0020003

DPSF2. Median age by sex - Female

dp0030001

DPSF3. Sex for population 16 years and over - Total

dp0030002

DPSF3. Sex for population 16 years and over - Male

dp0030003

DPSF3. Sex for population 16 years and over - Female

dp0040001

DPSF4. Sex for population 18 years and over - Total

dp0040002

DPSF4. Sex for population 18 years and over - Male

dp0040003

DPSF4. Sex for population 18 years and over - Female

dp0050001

DPSF5. Sex for population 21 years and over - Total

dp0050002

DPSF5. Sex for population 21 years and over - Male

dp0050003

DPSF5. Sex for population 21 years and over - Female

dp0060001

DPSF6. Sex for population 62 years and over - Total

dp0060002

DPSF6. Sex for population 62 years and over - Male

dp0060003

DPSF6. Sex for population 62 years and over - Female

dp0070001

DPSF7. Sex for population 65 years and over - Total

dp0070002

DPSF7. Sex for population 65 years and over - Male

dp0070003

DPSF7. Sex for population 65 years and over - Female

dp0080001

DPSF8. Race - Total population

dp0080002

DPSF8. Race - Population of one race

dp0080003

DPSF8. Race - White

dp0080004

DPSF8. Race - Black or African American

dp0080005

DPSF8. Race - American Indian and Alaska Native

dp0080006

DPSF8. Race - Asian total

dp0080007

DPSF8. Race - Asian Indian

dp0080008

DPSF8. Race - Chinese

dp0080009

DPSF8. Race - Filipino

dp0080010

DPSF8. Race - Japanese

dp0080011

DPSF8. Race - Korean

dp0080012

DPSF8. Race - Vietnamese

dp0080013

DPSF8. Race - Other Asian

dp0080014

DPSF8. Race - Native Hawaiian and Other Pacific Islander total

dp0080015

DPSF8. Race - Native Hawaiian

dp0080016

DPSF8. Race - Guamanian or Chamorro

dp0080017

DPSF8. Race - Samoan

dp0080018

DPSF8. Race - Other Pacific Islander

dp0080019

DPSF8. Race - Some Other Race

dp0080020

DPSF8. Race - Population of Two or More Races

dp0080021

DPSF8. Race - White; American Indian and Alaska Native

dp0080022

DPSF8. Race - White; Asian

dp0080023

DPSF8. Race - White; Black or African American

dp0080024

DPSF8. Race - White; Some Other Race

dp0090001

DPSF9. Race (total races tallied) - White alone or in combination with one or more other races

dp0090002

DPSF9. Race (total races tallied) - Black or African American alone or in combination with one or more other races

dp0090003

DPSF9. Race (total races tallied) - American Indian and Alaska Native alone or in combination with one or more other races

dp0090004

DPSF9. Race (total races tallied) - Asian alone or in combination with one or more other races

dp0090005

DPSF9. Race (total races tallied) - Native Hawaiian and Other Pacific Islander alone or in combination with one or more other races

dp0090006

DPSF9. Race (total races tallied) - Some Other Race alone or in combination with one or more other races

dp0100001

DPSF10. Hispanic or Latino by specific origin - Total population

dp0100002

DPSF10. Hispanic or Latino by specific origin - Hispanic or Latino (of any race)

dp0100003

DPSF10. Hispanic or Latino by specific origin - Mexican

dp0100004

DPSF10. Hispanic or Latino by specific origin - Puerto Rican

dp0100005

DPSF10. Hispanic or Latino by specific origin - Cuban

dp0100006

DPSF10. Hispanic or Latino by specific origin - Other Hispanic or Latino

dp0100007

DPSF10. Hispanic or Latino by specific origin - Not Hispanic or Latino

dp0110001

DPSF11. Hispanic or Latino and race - Total population

dp0110002

DPSF11. Hispanic or Latino and race - Hispanic or Latino total

dp0110003

DPSF11. Hispanic or Latino and race - Hispanic or Latino: White alone

dp0110004

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Black or African American alone

dp0110005

DPSF11. Hispanic or Latino and race - Hispanic or Latino: American Indian and Alaska Native alone

dp0110006

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Asian alone

dp0110007

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone

dp0110008

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Some Other Race alone

dp0110009

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Two or More Races

dp0110010

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino total

dp0110011

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: White alone

dp0110012

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Black or African American alone

dp0110013

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: American Indian and Alaska Native alone

dp0110014

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Asian alone

dp0110015

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone

dp0110016

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Some Other Race alone

dp0110017

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Two or More Races

dp0120001

DPSF12. Relationship - Total population

dp0120002

DPSF12. Relationship - In households

dp0120003

DPSF12. Relationship - Householder

dp0120004

DPSF12. Relationship - Spouse

dp0120005

DPSF12. Relationship - Child

dp0120006

DPSF12. Relationship - Own child under 18 years

dp0120007

DPSF12. Relationship - Other relatives

dp0120008

DPSF12. Relationship - Other relatives under 18 years

dp0120009

DPSF12. Relationship - Other relatives 65 years and over

dp0120010

DPSF12. Relationship - Nonrelatives

dp0120011

DPSF12. Relationship - Nonrelatives under 18 years

dp0120012

DPSF12. Relationship - Nonrelatives 65 years and over

dp0120013

DPSF12. Relationship - Unmarried partner

dp0120014

DPSF12. Relationship - In group quarters

dp0120015

DPSF12. Relationship - Institutionalized population

dp0120016

DPSF12. Relationship - Institutionalized population: Male

dp0120017

DPSF12. Relationship - Institutionalized population: Female

dp0120018

DPSF12. Relationship - Noninstitutionalized population

dp0120019

DPSF12. Relationship - Noninstitutionalized population: Male

dp0120020

DPSF12. Relationship - Noninstitutionalized population: Female

dp0130001

DPSF13. Households by type - Total households

dp0130002

DPSF13. Households by type - Family households (families)

dp0130003

DPSF13. Households by type - Family households with own children under 18 years

dp0130004

DPSF13. Households by type - Husband-wife family

dp0130005

DPSF13. Households by type - Husband-wife family with own children under 18 years

dp0130006

DPSF13. Households by type - Male householder, no wife present

dp0130007

DPSF13. Households by type - Male householder, no wife present, with own children under 18 years

dp0130008

DPSF13. Households by type - Female householder, no husband present

dp0130009

DPSF13. Households by type - Female householder, no husband present, with own children under 18 years

dp0130010

DPSF13. Households by type - Nonfamily households

dp0130011

DPSF13. Households by type - Householder living alone

dp0130012

DPSF13. Households by type - Householder living alone: Male

dp0130013

DPSF13. Households by type - Householder living alone: Male 65 years and over

dp0130014

DPSF13. Households by type - Householder living alone: Female

dp0130015

DPSF13. Households by type - Householder living alone: Female 65 years and over

dp0140001

DPSF14. Households with individuals under 18 years - Total

dp0150001

DPSF15. Households with individuals 65 years and over - Total

dp0160001

DPSF16. Average household size - Average household size

dp0170001

DPSF17. Average family size - Average family size

dp0180001

DPSF18. Housing occupancy - Total housing units

dp0180002

DPSF18. Housing occupancy - Occupied housing units

dp0180003

DPSF18. Housing occupancy - Vacant housing units

dp0180004

DPSF18. Housing occupancy - Vacant housing units for rent

dp0180005

DPSF18. Housing occupancy - Vacant housing units rented, not occupied

dp0180006

DPSF18. Housing occupancy - Vacant housing units for sale only

dp0180007

DPSF18. Housing occupancy - Vacant housing units sold, not occupied

dp0180008

DPSF18. Housing occupancy - Vacant housing units for seasonal, recreational, or occasional use

dp0180009

DPSF18. Housing occupancy - All other vacant housing units

dp0190001

DPSF19. Homeowner vacancy rate - Homeowner vacancy rate (percent)

dp0200001

DPSF20. Rental vacancy rate - Rental vacancy rate (percent)

dp0210001

DPSF21. Housing tenure - Total occupied housing units

dp0210002

DPSF21. Housing tenure - Owner-occupied housing units

dp0210003

DPSF21. Housing tenure - Renter-occupied housing units

dp0220001

DPSF22. Population in occupied housing units by tenure - Owner-occupied housing units

dp0220002

DPSF22. Population in occupied housing units by tenure - Renter-occupied housing units

dp0230001

DPSF23. Average household size by tenure - Owner occupied

dp0230002

DPSF23. Average household size by tenure - Renter occupied

out_censusdp1tract__tracts

package: pudl
Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

Census DP1 -- Profile of General Demographic Characteristics

Primary key:

tract_id_fips

Additional Details

Census tract-level summarized US Census demographic information, along with a geometry describing tract boundaries. Based on the 2010 Census. Used as a convenient source of geospatial information within PUDL, but should not be treated as authoritative.

Columns
tract_id_fips

Census tract 10-digit FIPS code

geometry

Geospatial representation of the feature.

tract_name

Census tract legal/statistical area description

land_area

Land area in square meters.

water_area

Water area in square meters.

internal_point_latitude

Internal point latitude in decimal degrees.

internal_point_longitude

Internal point longitude in decimal degrees.

shape_length

Length of the feature's perimeter in degrees.

shape_area

Area of the feature in square degrees.

dp0010001

DPSF1. Sex and age - Total population

dp0010002

DPSF1. Sex and age - Total population under 5 years

dp0010003

DPSF1. Sex and age - Total population 5 to 9 years

dp0010004

DPSF1. Sex and age - Total population 10 to 14 years

dp0010005

DPSF1. Sex and age - Total population 15 to 19 years

dp0010006

DPSF1. Sex and age - Total population 20 to 24 years

dp0010007

DPSF1. Sex and age - Total population 25 to 29 years

dp0010008

DPSF1. Sex and age - Total population 30 to 34 years

dp0010009

DPSF1. Sex and age - Total population 35 to 39 years

dp0010010

DPSF1. Sex and age - Total population 40 to 44 years

dp0010011

DPSF1. Sex and age - Total population 45 to 49 years

dp0010012

DPSF1. Sex and age - Total population 50 to 54 years

dp0010013

DPSF1. Sex and age - Total population 55 to 59 years

dp0010014

DPSF1. Sex and age - Total population 60 to 64 years

dp0010015

DPSF1. Sex and age - Total population 65 to 69 years

dp0010016

DPSF1. Sex and age - Total population 70 to 74 years

dp0010017

DPSF1. Sex and age - Total population 75 to 79 years

dp0010018

DPSF1. Sex and age - Total population 80 to 84 years

dp0010019

DPSF1. Sex and age - Total population 85 years and over

dp0010020

DPSF1. Sex and age - Male population total

dp0010021

DPSF1. Sex and age - Male population under 5 years

dp0010022

DPSF1. Sex and age - Male population 5 to 9 years

dp0010023

DPSF1. Sex and age - Male population 10 to 14 years

dp0010024

DPSF1. Sex and age - Male population 15 to 19 years

dp0010025

DPSF1. Sex and age - Male population 20 to 24 years

dp0010026

DPSF1. Sex and age - Male population 25 to 29 years

dp0010027

DPSF1. Sex and age - Male population 30 to 34 years

dp0010028

DPSF1. Sex and age - Male population 35 to 39 years

dp0010029

DPSF1. Sex and age - Male population 40 to 44 years

dp0010030

DPSF1. Sex and age - Male population 45 to 49 years

dp0010031

DPSF1. Sex and age - Male population 50 to 54 years

dp0010032

DPSF1. Sex and age - Male population 55 to 59 years

dp0010033

DPSF1. Sex and age - Male population 60 to 64 years

dp0010034

DPSF1. Sex and age - Male population 65 to 69 years

dp0010035

DPSF1. Sex and age - Male population 70 to 74 years

dp0010036

DPSF1. Sex and age - Male population 75 to 79 years

dp0010037

DPSF1. Sex and age - Male population 80 to 84 years

dp0010038

DPSF1. Sex and age - Male population 85 years and over

dp0010039

DPSF1. Sex and age - Female population total

dp0010040

DPSF1. Sex and age - Female population under 5 years

dp0010041

DPSF1. Sex and age - Female population 5 to 9 years

dp0010042

DPSF1. Sex and age - Female population 10 to 14 years

dp0010043

DPSF1. Sex and age - Female population 15 to 19 years

dp0010044

DPSF1. Sex and age - Female population 20 to 24 years

dp0010045

DPSF1. Sex and age - Female population 25 to 29 years

dp0010046

DPSF1. Sex and age - Female population 30 to 34 years

dp0010047

DPSF1. Sex and age - Female population 35 to 39 years

dp0010048

DPSF1. Sex and age - Female population 40 to 44 years

dp0010049

DPSF1. Sex and age - Female population 45 to 49 years

dp0010050

DPSF1. Sex and age - Female population 50 to 54 years

dp0010051

DPSF1. Sex and age - Female population 55 to 59 years

dp0010052

DPSF1. Sex and age - Female population 60 to 64 years

dp0010053

DPSF1. Sex and age - Female population 65 to 69 years

dp0010054

DPSF1. Sex and age - Female population 70 to 74 years

dp0010055

DPSF1. Sex and age - Female population 75 to 79 years

dp0010056

DPSF1. Sex and age - Female population 80 to 84 years

dp0010057

DPSF1. Sex and age - Female population 85 years and over

dp0020001

DPSF2. Median age by sex - Both sexes

dp0020002

DPSF2. Median age by sex - Male

dp0020003

DPSF2. Median age by sex - Female

dp0030001

DPSF3. Sex for population 16 years and over - Total

dp0030002

DPSF3. Sex for population 16 years and over - Male

dp0030003

DPSF3. Sex for population 16 years and over - Female

dp0040001

DPSF4. Sex for population 18 years and over - Total

dp0040002

DPSF4. Sex for population 18 years and over - Male

dp0040003

DPSF4. Sex for population 18 years and over - Female

dp0050001

DPSF5. Sex for population 21 years and over - Total

dp0050002

DPSF5. Sex for population 21 years and over - Male

dp0050003

DPSF5. Sex for population 21 years and over - Female

dp0060001

DPSF6. Sex for population 62 years and over - Total

dp0060002

DPSF6. Sex for population 62 years and over - Male

dp0060003

DPSF6. Sex for population 62 years and over - Female

dp0070001

DPSF7. Sex for population 65 years and over - Total

dp0070002

DPSF7. Sex for population 65 years and over - Male

dp0070003

DPSF7. Sex for population 65 years and over - Female

dp0080001

DPSF8. Race - Total population

dp0080002

DPSF8. Race - Population of one race

dp0080003

DPSF8. Race - White

dp0080004

DPSF8. Race - Black or African American

dp0080005

DPSF8. Race - American Indian and Alaska Native

dp0080006

DPSF8. Race - Asian total

dp0080007

DPSF8. Race - Asian Indian

dp0080008

DPSF8. Race - Chinese

dp0080009

DPSF8. Race - Filipino

dp0080010

DPSF8. Race - Japanese

dp0080011

DPSF8. Race - Korean

dp0080012

DPSF8. Race - Vietnamese

dp0080013

DPSF8. Race - Other Asian

dp0080014

DPSF8. Race - Native Hawaiian and Other Pacific Islander total

dp0080015

DPSF8. Race - Native Hawaiian

dp0080016

DPSF8. Race - Guamanian or Chamorro

dp0080017

DPSF8. Race - Samoan

dp0080018

DPSF8. Race - Other Pacific Islander

dp0080019

DPSF8. Race - Some Other Race

dp0080020

DPSF8. Race - Population of Two or More Races

dp0080021

DPSF8. Race - White; American Indian and Alaska Native

dp0080022

DPSF8. Race - White; Asian

dp0080023

DPSF8. Race - White; Black or African American

dp0080024

DPSF8. Race - White; Some Other Race

dp0090001

DPSF9. Race (total races tallied) - White alone or in combination with one or more other races

dp0090002

DPSF9. Race (total races tallied) - Black or African American alone or in combination with one or more other races

dp0090003

DPSF9. Race (total races tallied) - American Indian and Alaska Native alone or in combination with one or more other races

dp0090004

DPSF9. Race (total races tallied) - Asian alone or in combination with one or more other races

dp0090005

DPSF9. Race (total races tallied) - Native Hawaiian and Other Pacific Islander alone or in combination with one or more other races

dp0090006

DPSF9. Race (total races tallied) - Some Other Race alone or in combination with one or more other races

dp0100001

DPSF10. Hispanic or Latino by specific origin - Total population

dp0100002

DPSF10. Hispanic or Latino by specific origin - Hispanic or Latino (of any race)

dp0100003

DPSF10. Hispanic or Latino by specific origin - Mexican

dp0100004

DPSF10. Hispanic or Latino by specific origin - Puerto Rican

dp0100005

DPSF10. Hispanic or Latino by specific origin - Cuban

dp0100006

DPSF10. Hispanic or Latino by specific origin - Other Hispanic or Latino

dp0100007

DPSF10. Hispanic or Latino by specific origin - Not Hispanic or Latino

dp0110001

DPSF11. Hispanic or Latino and race - Total population

dp0110002

DPSF11. Hispanic or Latino and race - Hispanic or Latino total

dp0110003

DPSF11. Hispanic or Latino and race - Hispanic or Latino: White alone

dp0110004

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Black or African American alone

dp0110005

DPSF11. Hispanic or Latino and race - Hispanic or Latino: American Indian and Alaska Native alone

dp0110006

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Asian alone

dp0110007

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone

dp0110008

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Some Other Race alone

dp0110009

DPSF11. Hispanic or Latino and race - Hispanic or Latino: Two or More Races

dp0110010

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino total

dp0110011

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: White alone

dp0110012

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Black or African American alone

dp0110013

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: American Indian and Alaska Native alone

dp0110014

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Asian alone

dp0110015

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Native Hawaiian and Other Pacific Islander alone

dp0110016

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Some Other Race alone

dp0110017

DPSF11. Hispanic or Latino and race - Not Hispanic or Latino: Two or More Races

dp0120001

DPSF12. Relationship - Total population

dp0120002

DPSF12. Relationship - In households

dp0120003

DPSF12. Relationship - Householder

dp0120004

DPSF12. Relationship - Spouse

dp0120005

DPSF12. Relationship - Child

dp0120006

DPSF12. Relationship - Own child under 18 years

dp0120007

DPSF12. Relationship - Other relatives

dp0120008

DPSF12. Relationship - Other relatives under 18 years

dp0120009

DPSF12. Relationship - Other relatives 65 years and over

dp0120010

DPSF12. Relationship - Nonrelatives

dp0120011

DPSF12. Relationship - Nonrelatives under 18 years

dp0120012

DPSF12. Relationship - Nonrelatives 65 years and over

dp0120013

DPSF12. Relationship - Unmarried partner

dp0120014

DPSF12. Relationship - In group quarters

dp0120015

DPSF12. Relationship - Institutionalized population

dp0120016

DPSF12. Relationship - Institutionalized population: Male

dp0120017

DPSF12. Relationship - Institutionalized population: Female

dp0120018

DPSF12. Relationship - Noninstitutionalized population

dp0120019

DPSF12. Relationship - Noninstitutionalized population: Male

dp0120020

DPSF12. Relationship - Noninstitutionalized population: Female

dp0130001

DPSF13. Households by type - Total households

dp0130002

DPSF13. Households by type - Family households (families)

dp0130003

DPSF13. Households by type - Family households with own children under 18 years

dp0130004

DPSF13. Households by type - Husband-wife family

dp0130005

DPSF13. Households by type - Husband-wife family with own children under 18 years

dp0130006

DPSF13. Households by type - Male householder, no wife present

dp0130007

DPSF13. Households by type - Male householder, no wife present, with own children under 18 years

dp0130008

DPSF13. Households by type - Female householder, no husband present

dp0130009

DPSF13. Households by type - Female householder, no husband present, with own children under 18 years

dp0130010

DPSF13. Households by type - Nonfamily households

dp0130011

DPSF13. Households by type - Householder living alone

dp0130012

DPSF13. Households by type - Householder living alone: Male

dp0130013

DPSF13. Households by type - Householder living alone: Male 65 years and over

dp0130014

DPSF13. Households by type - Householder living alone: Female

dp0130015

DPSF13. Households by type - Householder living alone: Female 65 years and over

dp0140001

DPSF14. Households with individuals under 18 years - Total

dp0150001

DPSF15. Households with individuals 65 years and over - Total

dp0160001

DPSF16. Average household size - Average household size

dp0170001

DPSF17. Average family size - Average family size

dp0180001

DPSF18. Housing occupancy - Total housing units

dp0180002

DPSF18. Housing occupancy - Occupied housing units

dp0180003

DPSF18. Housing occupancy - Vacant housing units

dp0180004

DPSF18. Housing occupancy - Vacant housing units for rent

dp0180005

DPSF18. Housing occupancy - Vacant housing units rented, not occupied

dp0180006

DPSF18. Housing occupancy - Vacant housing units for sale only

dp0180007

DPSF18. Housing occupancy - Vacant housing units sold, not occupied

dp0180008

DPSF18. Housing occupancy - Vacant housing units for seasonal, recreational, or occasional use

dp0180009

DPSF18. Housing occupancy - All other vacant housing units

dp0190001

DPSF19. Homeowner vacancy rate - Homeowner vacancy rate (percent)

dp0200001

DPSF20. Rental vacancy rate - Rental vacancy rate (percent)

dp0210001

DPSF21. Housing tenure - Total occupied housing units

dp0210002

DPSF21. Housing tenure - Owner-occupied housing units

dp0210003

DPSF21. Housing tenure - Renter-occupied housing units

dp0220001

DPSF22. Population in occupied housing units by tenure - Owner-occupied housing units

dp0220002

DPSF22. Population in occupied housing units by tenure - Renter-occupied housing units

dp0230001

DPSF23. Average household size by tenure - Owner occupied

dp0230002

DPSF23. Average household size by tenure - Renter occupied

out_eia860__yearly_emissions_control_equipment

package: pudl

Annual time series of the cost, type, operating status, retirement date, and install year of emissions control equipment reported to EIA.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

report_year, plant_id_eia, emission_control_id_pudl

Additional Details

Includes control ids for sulfur dioxide (SO2), particulate matter, mercury, nitrogen oxide (NOX), and acid (HCl) gas monitoring.

This denormalized version includes plant name, utility id, pudl id, and utility name columns.

Columns
report_year

Four-digit year in which the data was reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

emission_control_id_pudl

A PUDL-generated ID used to distinguish emission control units in the same report year and plant id. This ID should not be used to track units over time or between plants.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

emission_control_equipment_type_code

Short code indicating the type of emission control equipment installed.

operational_status_code

The operating status of the asset.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

mercury_control_id_eia

Mercury control identification number. This ID is not a unique identifier.

nox_control_id_eia

Nitrogen oxide control identification number. This ID is not a unique identifier.

particulate_control_id_eia

Particulate matter control identification number. This ID is not a unique identifier.

so2_control_id_eia

Sulfur dioxide control identification number. This ID is not a unique identifier.

acid_gas_control

Indicates whether the emissions control equipment controls acid (HCl) gas.

emission_control_equipment_cost

The total cost to install a piece of emission control equipment.

emission_control_operating_date

The date a piece of emissions control equipment began operating. Derived from month and year columns in the raw data.

emission_control_retirement_date

The expected or actual retirement date for a piece of emissions control equipment. Derived from month and year columns in the raw data.

out_eia860__yearly_ownership

package: pudl

Annual time series of generator ownership. Includes only jointly or third-party owned generators.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 4)

Primary key:

report_date, plant_id_eia, generator_id, owner_utility_id_eia

Additional Details

Denormalized to include plant and utility names and other associated IDs.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

owner_utility_id_eia

The EIA utility Identification number for the owner company that is responsible for the day-to-day operations of the generator, not the operator utility.

operator_utility_id_eia

The EIA utility Identification number for the operator utility.

utility_id_pudl

A manually assigned PUDL utility ID for the owner company that is responsible for the day-to-day operations of the generator, not the operator utility. May not be stable over time.

owner_utility_name_eia

The name of the EIA owner utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

owner_state

Two letter ISO-3166 political subdivision code.

owner_city

City of owner.

owner_country

Three letter ISO-3166 country code.

owner_street_address

Steet address of owner.

owner_zip_code

Zip code of owner.

fraction_owned

Proportion of generator ownership attributable to this utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia861__yearly_balancing_authority_service_territory

package: pudl

Annual time series of counties in balancing authority service territories.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

balancing_authority_id_eia, report_date, county_id_fips, county

Columns
county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4.

county_name_census

County name as specified in Census DP1 Data.

population

County population, sourced from Census DP1 data.

area_km2

County area in km2.

report_date

Date reported.

balancing_authority_id_eia

EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.

state

Two letter US state abbreviation.

county

County name.

state_id_fips

Two digit state FIPS code.

out_eia861__yearly_utility_service_territory

package: pudl

Annual time series of counties in utility service territories.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

utility_id_eia, report_date, county_id_fips

Additional Details

Contains additional information about counties.

Columns
county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4.

county_name_census

County name as specified in Census DP1 Data.

population

County population, sourced from Census DP1 data.

area_km2

County area in km2.

report_date

Date reported.

utility_id_eia

The EIA Utility Identification number.

state

Two letter US state abbreviation.

county

County name.

state_id_fips

Two digit state FIPS code.

out_eia923__boiler_fuel

package: pudl

boiler fuel consumption and emissions.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, boiler_id, energy_source_code, prime_mover_code, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

Reports the quantity of each type of fuel consumed by each boiler on a monthly basis, as well as the sulfur and ash content of those fuels. Fuel quantity is reported in standard EIA fuel units (tons, barrels, Mcf). Heat content per unit of fuel is also reported, making this table useful for calculating the thermal efficiency (heat rate) of various generation units.

This table provides better coverage of the entire fleet of generators than the core_eia923__monthly_generation_fuel table, but the fuel consumption reported here is not directly associated with a generator. This complicates the heat rate calculation, since the associations between individual boilers and generators are incomplete and can be complex.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

boiler_id

Alphanumeric boiler ID.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

sulfur_content_pct

Sulfur content percentage by weight to the nearest 0.01 percent.

ash_content_pct

Ash content percentage by weight to the nearest 0.1 percent.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__generation

package: pudl

net electricity generated by power plant generators.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, generator_id, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Not all IDs are present.

  • Table has known low coverage - either geographic or temporal or otherwise.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.

Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__generation_fuel_combined

package: pudl

generation and fuel consumption of all generation units.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, report_date, prime_mover_code, energy_source_code

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

Denormalized, combined data from the core_eia923__monthly_generation_fuel and core_eia923__monthly_generation_fuel_nuclear with nuclear generation aggregated from the nuclear generation unit level up to the plant prime mover level, so as to be compatible with fossil fuel generation data.

This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.

The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_type_code_agg

A partial aggregation of the reported fuel type codes into larger categories used by EIA in, for example, the Annual Energy Review (AER) or Monthly Energy Review (MER). Two or three letter alphanumeric.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_units

Consumption for electric generation of the fuel type in physical unit.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__monthly_boiler_fuel

package: pudl

Monthly time series of boiler fuel consumption and emissions.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, boiler_id, energy_source_code, prime_mover_code, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

Reports the quantity of each type of fuel consumed by each boiler on a monthly basis, as well as the sulfur and ash content of those fuels. Fuel quantity is reported in standard EIA fuel units (tons, barrels, Mcf). Heat content per unit of fuel is also reported, making this table useful for calculating the thermal efficiency (heat rate) of various generation units.

This table provides better coverage of the entire fleet of generators than the core_eia923__monthly_generation_fuel table, but the fuel consumption reported here is not directly associated with a generator. This complicates the heat rate calculation, since the associations between individual boilers and generators are incomplete and can be complex.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

boiler_id

Alphanumeric boiler ID.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

sulfur_content_pct

Sulfur content percentage by weight to the nearest 0.01 percent.

ash_content_pct

Ash content percentage by weight to the nearest 0.1 percent.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__monthly_fuel_receipts_costs

package: pudl

Monthly time series of fuel deliveries to power plants.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 2 - Part A)

Primary key:

This table has no primary key. Each record describes an individual fuel delivery. There can be multiple deliveries of the same type of fuel from the same supplier to the same plant in a single month, so the table has no natural primary key.

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Some values have been redacted.

  • Time of fuel deliveries is not necessarily connected with time of fuel consumption.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

There can be a significant delay between the receipt of fuel and its consumption, so using this table to infer monthly attributes associated with power generation may not be entirely accurate. However, this is the most granular data we have describing fuel costs, and we use it in calculating the marginal cost of electricity for individual generation units.

Under some circumstances utilities are allowed to treat the price of fuel as proprietary business data, meaning it is redacted from the publicly available spreadsheets. It's still reported to EIA and influences the aggregated (state, region, annual, etc.) fuel prices they publish. From 2009-2021 about 1/3 of all prices are redacted. The missing data is not randomly distributed. Deregulated markets dominated by merchant generators (independent power producers) redact much more data, and natural gas is by far the most likely fuel to have its price redacted. This means, for instance, that the entire Northeastern US reports essentially no fine-grained data about its natural gas prices.

Additional data which we haven't yet integrated is available in a similar format from 2002-2008 via the EIA-423, and going back as far as 1972 from the FERC-423.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

state

Two letter US state abbreviation.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_received_units

Quantity of fuel received in tons, barrel, or Mcf.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

fuel_cost_per_mmbtu_source

Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).

sulfur_content_pct

Sulfur content percentage by weight to the nearest 0.01 percent.

ash_content_pct

Ash content percentage by weight to the nearest 0.1 percent.

mercury_content_ppm

Mercury content in parts per million (ppm) to the nearest 0.001 ppm.

moisture_content_pct

For coal only: the moisture content of the fuel in terms of moisture percentage by weight. Reported to the nearest 0.01 percent.

chlorine_content_ppm

For coal only: the chlorine content in parts per million (ppm) to the nearest 0.001 ppm. If lab tests of the coal do not include the chlorine content, this field contains the amount specified in the contract with the supplier.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__monthly_generation

package: pudl

Monthly time series of net electricity generated by power plant generators.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, generator_id, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Not all IDs are present.

  • Table has known low coverage - either geographic or temporal or otherwise.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.

Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__monthly_generation_fuel_by_generator

package: pudl

Monthly time series of of estimated net generation and fuel consumption by generator.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

Based on allocating net electricity generation and fuel consumption reported in the EIA-923 generation and generation_fuel tables to individual generators.

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL aggregates the net generation and fuel consumption that has been allocated to the generator_id/energy_source_code/prime_mover_code level in the``out_eia923__monthly_generation_fuel_by_generator_energy_source`` to the generator level.This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

out_eia923__monthly_generation_fuel_by_generator_energy_source

package: pudl

Monthly time series of of estimated net generation and fuel consumption associated with each combination of generator, energy source, and prime mover.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Contains information from multiple raw inputs.

Additional Details

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL has allocated the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel to the generator_id/energy_source_code/prime_mover_code level.

The allocation process entails generating a fraction for each record based on the net generation in the core_eia923__monthly_generation table and the capacity from the core_eia860__scd_generators table. This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

energy_source_code_num

Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

out_eia923__monthly_generation_fuel_combined

package: pudl

Monthly time series of generation and fuel consumption of all generation units.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, report_date, prime_mover_code, energy_source_code

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

Denormalized, combined data from the core_eia923__monthly_generation_fuel and core_eia923__monthly_generation_fuel_nuclear with nuclear generation aggregated from the nuclear generation unit level up to the plant prime mover level, so as to be compatible with fossil fuel generation data.

This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.

The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_units

Consumption for electric generation of the fuel type in physical unit.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__yearly_boiler_fuel

package: pudl

Annual time series of boiler fuel consumption and emissions.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, boiler_id, energy_source_code, prime_mover_code, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

Reports the quantity of each type of fuel consumed by each boiler on a monthly basis, as well as the sulfur and ash content of those fuels. Fuel quantity is reported in standard EIA fuel units (tons, barrels, Mcf). Heat content per unit of fuel is also reported, making this table useful for calculating the thermal efficiency (heat rate) of various generation units.

This table provides better coverage of the entire fleet of generators than the core_eia923__monthly_generation_fuel table, but the fuel consumption reported here is not directly associated with a generator. This complicates the heat rate calculation, since the associations between individual boilers and generators are incomplete and can be complex.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

boiler_id

Alphanumeric boiler ID.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

sulfur_content_pct

Sulfur content percentage by weight to the nearest 0.01 percent.

ash_content_pct

Ash content percentage by weight to the nearest 0.1 percent.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__yearly_fuel_receipts_costs

package: pudl

Annual time series of fuel deliveries to power plants.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 2 - Part A)

Primary key:

This table has no primary key. Each record describes an individual fuel delivery. There can be multiple deliveries of the same type of fuel from the same supplier to the same plant in a single month, so the table has no natural primary key.

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Some values have been redacted.

  • Time of fuel deliveries is not necessarily connected with time of fuel consumption.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

There can be a significant delay between the receipt of fuel and its consumption, so using this table to infer monthly attributes associated with power generation may not be entirely accurate. However, this is the most granular data we have describing fuel costs, and we use it in calculating the marginal cost of electricity for individual generation units.

Under some circumstances utilities are allowed to treat the price of fuel as proprietary business data, meaning it is redacted from the publicly available spreadsheets. It's still reported to EIA and influences the aggregated (state, region, annual, etc.) fuel prices they publish. From 2009-2021 about 1/3 of all prices are redacted. The missing data is not randomly distributed. Deregulated markets dominated by merchant generators (independent power producers) redact much more data, and natural gas is by far the most likely fuel to have its price redacted. This means, for instance, that the entire Northeastern US reports essentially no fine-grained data about its natural gas prices.

Additional data which we haven't yet integrated is available in a similar format from 2002-2008 via the EIA-423, and going back as far as 1972 from the FERC-423.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

state

Two letter US state abbreviation.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_received_units

Quantity of fuel received in tons, barrel, or Mcf.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

fuel_cost_per_mmbtu_source

Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).

sulfur_content_pct

Sulfur content percentage by weight to the nearest 0.01 percent.

ash_content_pct

Ash content percentage by weight to the nearest 0.1 percent.

mercury_content_ppm

Mercury content in parts per million (ppm) to the nearest 0.001 ppm.

moisture_content_pct

For coal only: the moisture content of the fuel in terms of moisture percentage by weight. Reported to the nearest 0.01 percent.

chlorine_content_ppm

For coal only: the chlorine content in parts per million (ppm) to the nearest 0.001 ppm. If lab tests of the coal do not include the chlorine content, this field contains the amount specified in the contract with the supplier.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__yearly_generation

package: pudl

Annual time series of net electricity generated by power plant generators.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, generator_id, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Not all IDs are present.

  • Table has known low coverage - either geographic or temporal or otherwise.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.

Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia923__yearly_generation_fuel_by_generator

package: pudl

Annual time series of of estimated net generation and fuel consumption by generator.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

Based on allocating net electricity generation and fuel consumption reported in the EIA-923 generation and generation_fuel tables to individual generators.

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL aggregates the net generation and fuel consumption that has been allocated to the generator_id/energy_source_code/prime_mover_code level in the``out_eia923__yearly_generation_fuel_by_generator_energy_source`` to the generator level.This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

out_eia923__yearly_generation_fuel_by_generator_energy_source

package: pudl

Annual time series of of estimated net generation and fuel consumption associated with each combination of generator, energy source, and prime mover.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Contains information from multiple raw inputs.

Additional Details

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL has allocated the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel to the generator_id/energy_source_code/prime_mover_code level.

The allocation process entails generating a fraction for each record based on the net generation in the core_eia923__monthly_generation table and the capacity from the core_eia860__scd_generators table. This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

energy_source_code_num

Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

out_eia923__yearly_generation_fuel_by_generator_energy_source_owner

package: pudl

Annual time series of of estimated net generation and fuel consumption for each generator, associated with each combination of generator, energy source, prime mover, and owner.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

report_date, plant_id_eia, generator_id, prime_mover_code, energy_source_code, utility_id_eia, ownership_record_type

Usage Warnings

  • Contains estimated values.

  • This downscaling process used to create this table does not distinguish between primary and secondary energy_sources for generators (see below for implications).

  • Date column arbitrarily uses the first of the month.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

The net generation and fuel consumption allocation method PUDL employs begins with the following context of the originally reported EIA-860 and EIA-923 data:

  • The core_eia923__monthly_generation_fuel table is the authoritative source of information about how much generation and fuel consumption is attributable to an entire plant. This table has the most complete data coverage, but it is not the most granular data reported.

  • The core_eia923__monthly_generation table contains the most granular net generation data. It is reported at the plant_id_eia, generator_id and report_date level. This table includes only ~40% of the total MWhs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia923__monthly_boiler_fuel table contains the most granular fuel consumption data. It is reported at the boiler/prime mover/energy source level. This table includes only ~40% of the total MMBTUs reported in the core_eia923__monthly_generation_fuel table.

  • The core_eia860__scd_generators table provides an exhaustive list of all generators whose generation is being reported in the core_eia923__monthly_generation_fuel table.

In this table, PUDL has scaled the net electricity generation and fuel consumption from core_eia923__monthly_generation_fuel by ownership from out_eia860__yearly_ownership. Note that the utility_id_eia in this table refers to the OWNER of the generator, not the operator.

This process does not distinguish between primary and secondary energy_sources for generators. Net generation is allocated equally between energy source codes, so if a plant has multiple generators with the same prime_mover_code but different energy source codes the core_eia923__monthly_generation_fuel records will be associated similarly between these two generators. Allocated net generation will still be proportional to each generator's net generation or capacity.

This table inherits canonicalized values for generators and plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

utility_id_eia

The EIA Utility Identification number.

ownership_record_type

Whether each generator record is for one owner or represents a total of all ownerships.

fraction_owned

Proportion of generator ownership attributable to this utility.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

energy_source_code_num

Name of the energy_source_code_N column that this energy source code was reported in for the generator referenced in the same record.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

out_eia923__yearly_generation_fuel_combined

package: pudl

Annual time series of generation and fuel consumption of all generation units.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, report_date, prime_mover_code, energy_source_code

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

Denormalized, combined data from the core_eia923__monthly_generation_fuel and core_eia923__monthly_generation_fuel_nuclear with nuclear generation aggregated from the nuclear generation unit level up to the plant prime mover level, so as to be compatible with fossil fuel generation data.

This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.

The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_units

Consumption for electric generation of the fuel type in physical unit.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_eia930__hourly_aggregated_demand

package: pudl

Hourly time series of aggregated balancing authority demand by EIA region, interconnect, and continental US.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report

Primary key:

datetime_utc, aggregation_level, aggregation_group

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • This table is experimental and/or a work in progress and may change in the future.

Additional Details

The spatial granularity of each record is indicated by aggregation_level.

Columns
datetime_utc

Date and time converted to Coordinated Universal Time (UTC).

aggregation_level

Indicates the spacial granularity of aggregated value.

aggregation_group

Label identifying a group of balancing authorities to be used in aggregating demand E.g. a region of the US or a whole interconnect.

demand_imputed_pudl_mwh

Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed in PUDL.

out_eia930__hourly_operations

package: pudl

Hourly time series of balancing authority net generation, interchange, and demand with imputed demand.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report

Primary key:

datetime_utc, balancing_authority_code_eia

Usage Warnings

  • Contains rows where missing values were imputed.

  • This table is experimental and/or a work in progress and may change in the future.

Additional Details

This table is based on core_eia930__hourly_operations, but adds imputed demand where the original data was missing or anomalous. Codes explaining why values have been imputed can be found in the core_pudl__codes_imputation_reasons table.

Columns
datetime_utc

Date and time converted to Coordinated Universal Time (UTC).

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

net_generation_reported_mwh

Unaltered originally reported net generation for the specified period.

net_generation_adjusted_mwh

Reported net generation adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.

net_generation_imputed_eia_mwh

Reported net generation with outlying values removed and missing values imputed by EIA.

interchange_reported_mwh

Original reported energy interchange between adjacent balancing authorities.

interchange_adjusted_mwh

Energy interchange between adjacent balancing authorities, adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.

interchange_imputed_eia_mwh

Energy interchange between adjacent balancing authorities, with outliers and missing values imputed by EIA.

demand_reported_mwh

Originally reported electricity demand, calculated by taking the net generation within the BA and subtracting the interchange with adjacent BAs.

demand_adjusted_mwh

Electricity demand adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.

demand_imputed_pudl_mwh

Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed in PUDL.

demand_imputed_pudl_mwh_imputation_code

Code describing why a demand value was flagged for imputation.

demand_imputed_eia_mwh

Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed by EIA.

demand_forecast_mwh

Day ahead demand forecast.

out_eia930__hourly_subregion_demand

package: pudl

Hourly time series of balancing authority subregion demand with imputed demand.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report

Primary key:

datetime_utc, balancing_authority_code_eia, balancing_authority_subregion_code_eia

Usage Warnings

  • Contains rows where missing values were imputed.

  • This table is experimental and/or a work in progress and may change in the future.

Additional Details

This table is based on core_eia930__hourly_subregion_demand, but adds imputed demand where the original data was missing or anomalous. Codes explaining why values have been imputed can be found in the core_pudl__codes_imputation_reasons table.

Columns
datetime_utc

Date and time converted to Coordinated Universal Time (UTC).

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_subregion_code_eia

Code identifying subregions of larger balancing authorities.

demand_reported_mwh

Originally reported electricity demand, calculated by taking the net generation within the BA and subtracting the interchange with adjacent BAs.

demand_imputed_pudl_mwh

Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed in PUDL.

demand_imputed_pudl_mwh_imputation_code

Code describing why a demand value was flagged for imputation.

out_eia__yearly_assn_plant_parts_plant_gen

package: pudl

Annual Association table providing connections between plant parts (identified as record_id_eia) and 'plant_gen' records (identified as record_id_eia_plant_gen) from the out_eia__yearly_plant_parts table.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

record_id_eia, record_id_eia_plant_gen

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Contains information from multiple raw inputs.

Additional Details

The purpose of this table is to help determine what generator records are associated with every plant part record. The plant part records have a one to many relationship to the associated 'plant_gen' records. The 'plant_gen' records are also included in the plant part records, so for these records there is a one to one relationship and the record_id_eia and the record_id_eia_plant_gen are the same.

All of the columns in this table that have a suffix of '_plant_gen' are attributes of the record_id_eia_plant_gen record.

Columns
record_id_eia

Identifier for EIA plant parts analysis records.

record_id_eia_plant_gen

Identifier for EIA plant parts analysis records which is at the plant_part level of plant_gen - meaning each record pertains to one generator.

report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

utility_id_eia

The EIA Utility Identification number.

ownership_record_type

Whether each generator record is for one owner or represents a total of all ownerships.

generator_id_plant_gen

Generator ID of the record_id_eia_plant_gen record. This is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

energy_source_code_1_plant_gen

Code representing the most predominant type of energy that fuels the record_id_eia_plant_gen's generator.

prime_mover_code_plant_gen

Code for the type of prime mover (e.g. CT, CG) associated with the record_id_eia_plant_gen.

unit_id_pudl_plant_gen

Dynamically assigned PUDL unit id of the record_id_eia_plant_gen. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

technology_description_plant_gen

High level description of the technology used by the record_id_eia_plant_gen's generator to produce electricity.

ferc_acct_name_plant_gen

Name of FERC account, derived from technology description and prime mover code. This name is associated with the record_id_eia_plant_gen record.

ferc1_generator_agg_id_plant_gen

ID dynamically assigned by PUDL to EIA records with multiple matches to a single FERC ID in the FERC-EIA manual matching process. This ID is associated with the record_id_eia_plant_gen record. It depends on ferc1_generator_agg_id, which has not been updated since 2020, but only affects a couple hundred records total across all years.

generator_operating_year_plant_gen

The year an associated plant_gen's generator went into service.

operational_status_pudl_plant_gen

The operating status of the asset using PUDL categories of the record_id_eia_plant_gen record .

generators_number

The number of generators associated with each record_id_eia.

out_eia__yearly_boilers

package: pudl

Annual time series of all EIA boiler attributes.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

plant_id_eia, boiler_id, report_date

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This table inherits canonicalized values for boilers, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

boiler_id

Alphanumeric boiler ID.

air_flow_100pct_load_cubic_feet_per_minute

Total air flow including excess air at 100 percent load, reported at standard temperature and pressure (i.e. 68 F and one atmosphere pressure).

boiler_fuel_code_1

The code representing the most predominant type of energy that fuels the boiler.

boiler_fuel_code_2

The code representing the second most predominant type of energy that fuels the boiler.

boiler_fuel_code_3

The code representing the third most predominant type of energy that fuels the boiler.

boiler_fuel_code_4

The code representing the fourth most predominant type of energy that fuels the boiler.

boiler_manufacturer

Name of boiler manufacturer.

boiler_manufacturer_code

EIA short code for boiler manufacturer.

boiler_operating_date

Date the boiler began or is planned to begin commercial operation.

boiler_retirement_date

Date of the scheduled or effected retirement of the boiler.

boiler_status

EIA short code identifying boiler operational status.

boiler_type

EIA short code indicating the standards under which the boiler is operating as described in the U.S. EPA regulation under 40 CFR.

city

Name of the city.

compliance_year_mercury

Year boiler was or is expected to be in compliance with federal, state and/or local regulations for mercury emissions.

compliance_year_nox

Year boiler was or is expected to be in compliance with federal, state and/or local regulations for nitrogen oxide emissions.

compliance_year_particulate

Year boiler was or is expected to be in compliance with federal, state and/or local regulations for particulate matter emissions.

compliance_year_so2

Year boiler was or is expected to be in compliance with federal, state and/or local regulations for sulfur dioxide emissions.

county

County name.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

efficiency_100pct_load

Boiler efficiency percentage when burning at 100 percent load to the nearest 0.1 percent.

efficiency_50pct_load

Boiler efficiency percentage when burning at 50 percent load to the nearest 0.1 percent.

firing_rate_using_coal_tons_per_hour

Design firing rate at maximum continuous steam flow for coal to the nearest 0.1 ton per hour.

firing_rate_using_gas_mcf_per_hour

Design firing rate at maximum continuous steam flow for gas to the nearest 0.1 cubic feet per hour.

firing_rate_using_oil_bbls_per_hour

Design firing rate at maximum continuous steam flow for pet coke to the nearest 0.1 barrels per hour.

firing_rate_using_other_fuels

Design firing rate at maximum continuous steam flow for energy sources other than coal, petroleum, or natural gas.

firing_type_1

EIA short code indicating the type of firing used by this boiler.

firing_type_2

EIA short code indicating the type of firing used by this boiler.

firing_type_3

EIA short code indicating the type of firing used by this boiler.

fly_ash_reinjection

Indicates whether the boiler is capable of re-injecting fly ash.

hrsg

indicates if the boiler is a heat recovery steam generator (HRSG).

latitude

Latitude of the plant's location, in degrees.

longitude

Longitude of the plant's location, in degrees.

max_steam_flow_lbs_per_hour

Maximum continuous steam flow at 100 percent load.

mercury_control_existing_strategy_1

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_2

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_3

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_4

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_5

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_6

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_proposed_strategy_1

Proposed strategy to comply with the most stringent mercury regulation.

mercury_control_proposed_strategy_2

Proposed strategy to comply with the most stringent mercury regulation.

mercury_control_proposed_strategy_3

Proposed strategy to comply with the most stringent mercury regulation.

new_source_review

Indicates whether the boiler is subject to New Source Review requirements.

new_source_review_date

Month of issued New Source Review permit.

new_source_review_permit

New Source Review permit number.

nox_control_existing_caaa_compliance_strategy_1

Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_existing_caaa_compliance_strategy_2

Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_existing_caaa_compliance_strategy_3

Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_existing_strategy_1

Existing strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_existing_strategy_2

Existing strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_existing_strategy_3

Existing strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_manufacturer

Name of nitrogen oxide control manufacturer.

nox_control_manufacturer_code

Code indicating the nitrogen oxide control burner manufacturer.

nox_control_out_of_compliance_strategy_1

If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.

nox_control_out_of_compliance_strategy_2

If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.

nox_control_out_of_compliance_strategy_3

If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.

nox_control_planned_caaa_compliance_strategy_1

Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_planned_caaa_compliance_strategy_2

Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_planned_caaa_compliance_strategy_3

Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_proposed_strategy_1

Proposed strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_proposed_strategy_2

Proposed strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_proposed_strategy_3

Proposed strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_status_code

Nitrogen oxide control status code.

particulate_control_out_of_compliance_strategy_1

If boiler is not in compliance with particulate matter regulations, strategy for compliance.

particulate_control_out_of_compliance_strategy_2

If boiler is not in compliance with particulate matter regulations, strategy for compliance.

particulate_control_out_of_compliance_strategy_3

If boiler is not in compliance with particulate matter regulations, strategy for compliance.

regulation_mercury

Most stringent type of statute or regulation code under which the boiler is operating for mercury control standards.

regulation_nox

EIA short code for most stringent type of statute or regulation code under which the boiler is operating for nitrogen oxide control standards.

regulation_particulate

EIA short code for most stringent type of statute or regulation code under which the boiler is operating for particulate matter control standards.

regulation_so2

EIA short code for most stringent type of statute or regulation code under which the boiler is operating for sulfur dioxide control standards.

so2_control_existing_caaa_compliance_strategy_1

Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_existing_caaa_compliance_strategy_2

Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_existing_caaa_compliance_strategy_3

Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_existing_strategy_1

Existing strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_existing_strategy_2

Existing strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_existing_strategy_3

Existing strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_out_of_compliance_strategy_1

If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.

so2_control_out_of_compliance_strategy_2

If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.

so2_control_out_of_compliance_strategy_3

If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.

so2_control_planned_caaa_compliance_strategy_1

Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_planned_caaa_compliance_strategy_2

Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_planned_caaa_compliance_strategy_3

Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_proposed_strategy_1

Proposed strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_proposed_strategy_2

Proposed strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_proposed_strategy_3

Proposed strategy to comply with the most stringent sulfur dioxide regulation.

standard_nox_rate

Numeric value for the unit of measurement specified for nitrogen oxide.

standard_particulate_rate

Numeric value for the unit of measurement specified for particulate matter.

standard_so2_percent_scrubbed

The percent of sulfur dioxide to be scrubbed specified by the most stringent sulfur dioxide regulation.

standard_so2_rate

Numeric value for the unit of measurement specified for sulfur dioxide.

state

Two letter US state abbreviation.

street_address

Physical street address.

timezone

IANA timezone name

turndown_ratio

The turndown ratio for the boiler.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

unit_nox

Numeric value for the unit of measurement specified for nitrogen oxide.

unit_particulate

Numeric value for the unit of measurement specified for particulate matter.

unit_so2

Numeric value for the unit of measurement specified for sulfur dioxide.

waste_heat_input_mmbtu_per_hour

Design waste-heat input rate at maximum continuous steam flow where a waste-heat boiler is a boiler that receives all or a substantial portion of its energy input from the noncumbustible exhaust gases of a separate fuel-burning process (MMBtu per hour).

wet_dry_bottom

Wet or Dry Bottom where Wet Bottom is defined as slag tanks that are installed at furnace throat to contain and remove molten ash from the furnace, and Dry Bottom is defined as having no slag tanks at furnace throat area, throat area is clear, and bottom ash drops through throat to bottom ash water hoppers.

zip_code

Five digit US Zip Code.

out_eia__yearly_generators_by_ownership

package: pudl

Annual time series of all EIA generators with ownership integrated.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

This table has no primary key. This table would have a primary key of the following columns, except there are ~200 records without utility IDs: ['plant_id_eia', 'generator_id', 'report_date', 'utility_id_eia', 'ownership_record_type'].

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

Generators that report to EIA can have multiple utility owners. EIA reports the ownership fraction for each of the owners of generators (see core_eia860__scd_ownership). This table includes two records for each generator's owner: one record represents a total ownership stake and the other record represents a proportional ownership stake (the ownership_record_type column labeled as "total" and "owned" respectively). All of the values reported for these generator-owner records are scaled to the ownership stake (represented by fraction_owned).

This table was created to represent every possible version of ownership slices of the EIA generators. It is primarily used as an input to out_eia__yearly_plant_parts.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

technology_description

High level description of the technology used by the generator to produce electricity.

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

generator_operating_date

Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.

generator_retirement_date

Date of the scheduled or effected retirement of the generator.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

planned_generator_retirement_date

Planned effective date of the scheduled retirement of the generator.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

fuel_cost_per_mmbtu_source

Indicates the source of the values in the fuel_cost_per_mmbtu column. The fuel cost either comes directly from the EIA forms (original), was filled in from the EIA's API using state-level averages (eiaapi), was filled in using a rolling average (rolling_avg) or When the records get aggregated together and contain multiple sources (mixed).

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_cost_per_mwh

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

unit_heat_rate_mmbtu_per_mwh

Fuel content per unit of electricity generated. Coming from MCOE calculation.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

total_mmbtu

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

ferc_acct_name

Name of FERC account, derived from technology description and prime mover code.

generator_operating_year

Year a generator went into service.

operational_status_pudl

The operating status of the asset using PUDL categories.

capacity_eoy_mw

Total end of year installed (nameplate) capacity for a plant part, in megawatts.

fraction_owned

Proportion of generator ownership attributable to this utility.

ownership_record_type

Whether each generator record is for one owner or represents a total of all ownerships.

out_eia__yearly_plant_parts

package: pudl

Annual time series of all EIA plant parts, for use in matching to FERC 1.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

record_id_eia

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

Practically speaking, a plant is a collection of generator(s). There are many attributes of generators (i.e. prime mover, primary fuel source, technology type). We can use these generator attributes to group generator records into larger aggregate records which we call "plant parts". A plant part is a record which corresponds to a particular collection of generators that all share an identical attribute and utility owner. E.g. all of the generators with unit_id=2, or all of the generators with coal as their primary fuel source.

The EIA data about power plants (from EIA 923 and 860) is reported in tables with records that correspond to mostly generators and plants. Other datasets (cough cough FERC1) are less well organized and include plants, generators and other plant parts all in the same table without any clear labels. This plant part table is an attempt to create records corresponding to many different plant parts in order to connect specific slices of EIA plants to other datasets.

Because generators are often owned by multiple utilities, another dimension of this plant part table involves generating two records for each owner: one for the portion of the plant part they own and one for the plant part as a whole. The portion records are labeled in the ownership_record_type column as owned and the total records are labeled as total.

This table includes A LOT of duplicative information about EIA plants. It is primarily meant for use as an input into the record linkage between FERC1 plants and EIA.

This table inherits canonicalized values for generators, plants, and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
record_id_eia

Identifier for EIA plant parts analysis records.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

Date reported.

plant_part

The part of the plant a record corresponds to.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

technology_description

High level description of the technology used by the generator to produce electricity.

ferc_acct_name

Name of FERC account, derived from technology description and prime mover code.

utility_id_eia

The EIA Utility Identification number.

true_gran

Indicates whether a plant part list record is associated with the highest priority plant part for all identical records.

appro_part_label

Plant part of the associated true granularity record.

appro_record_id_eia

EIA record ID of the associated true granularity record.

ferc1_generator_agg_id

ID dynamically assigned by PUDL to EIA records with multiple matches to a single FERC ID in the FERC-EIA manual matching process. The ID is manually assigned and has not been updated since 2020, but only affects a couple hundred records total across all years.

capacity_eoy_mw

Total end of year installed (nameplate) capacity for a plant part, in megawatts.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

construction_year

Year the plant's oldest still operational unit was built.

fraction_owned

Proportion of generator ownership attributable to this utility.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_cost_per_mwh

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

generator_retirement_date

Date of the scheduled or effected retirement of the generator.

unit_heat_rate_mmbtu_per_mwh

Fuel content per unit of electricity generated. Coming from MCOE calculation.

installation_year

Year the plant's most recently built unit was installed.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

generator_operating_year

Year a generator went into service.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

operational_status_pudl

The operating status of the asset using PUDL categories.

ownership_record_type

Whether each generator record is for one owner or represents a total of all ownerships.

ownership_dupe

Whether a plant part record has a duplicate record with different ownership status.

planned_generator_retirement_date

Planned effective date of the scheduled retirement of the generator.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_eia

Plant name.

plant_name_ppe

Derived plant name that includes EIA plant name and other strings associated with ID and PK columns of the plant part.

plant_part_id_eia

Contains EIA plant ID, plant part, ownership, and EIA utility id

record_count

Number of distinct generator IDs that participated in the aggregation for a plant part list record.

total_fuel_cost

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

total_mmbtu

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

report_year

Four-digit year in which the data was reported.

plant_id_report_year

PUDL plant ID and report year of the record.

out_eia__yearly_plants

package: pudl

Annual time series of all EIA plant attributes.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

plant_id_eia, report_date

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This table inherits canonicalized values for plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_name_eia

Plant name.

city

Name of the city.

county

County name.

latitude

Latitude of the plant's location, in degrees.

longitude

Longitude of the plant's location, in degrees.

state

Two letter US state abbreviation.

street_address

Physical street address.

zip_code

Five digit US Zip Code.

timezone

IANA timezone name

report_date

Date reported.

ash_impoundment

Is there an ash impoundment (e.g. pond, reservoir) at the plant?

ash_impoundment_lined

If there is an ash impoundment at the plant, is the impoundment lined?

ash_impoundment_status

If there is an ash impoundment at the plant, the ash impoundment status as of December 31 of the reporting year.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

datum

Geodetic coordinate system identifier (e.g. NAD27, NAD83, or WGS84).

energy_storage

Indicates if the facility has energy storage capabilities.

ferc_cogen_docket_no

The docket number relating to the FERC cogenerator status. See FERC Form 556.

ferc_cogen_status

Indicates whether the plant has FERC qualifying facility cogenerator status. See FERC Form 556.

ferc_exempt_wholesale_generator_docket_no

The docket number relating to the FERC qualifying facility exempt wholesale generator status.

ferc_exempt_wholesale_generator

Indicates whether the plant has FERC qualifying facility exempt wholesale generator status

ferc_small_power_producer_docket_no

The docket number relating to the FERC qualifying facility small power producer status. See FERC Form 556.

ferc_small_power_producer

Indicates whether the plant has FERC qualifying facility small power producer status. See FERC Form 556.

ferc_qualifying_facility_docket_no

The docket number relating to the FERC qualifying facility cogenerator status. See FERC Form 556.

grid_voltage_1_kv

Plant's grid voltage at point of interconnection to transmission or distribution facilities

grid_voltage_2_kv

Plant's grid voltage at point of interconnection to transmission or distribution facilities

grid_voltage_3_kv

Plant's grid voltage at point of interconnection to transmission or distribution facilities

iso_rto_code

The code of the plant's ISO or RTO. NA if not reported in that year.

liquefied_natural_gas_storage

Indicates if the facility have the capability to store the natural gas in the form of liquefied natural gas.

natural_gas_local_distribution_company

Names of Local Distribution Company (LDC), connected to natural gas burning power plants.

natural_gas_storage

Indicates if the facility have on-site storage of natural gas.

natural_gas_pipeline_name_1

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

natural_gas_pipeline_name_2

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

natural_gas_pipeline_name_3

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

nerc_region

NERC region in which the plant is located

has_net_metering

Whether the plant has a net metering agreement in effect during the reporting year. (Only displayed for facilities that report the sun or wind as an energy source). This field was only reported up until 2015

pipeline_notes

Additional owner or operator of natural gas pipeline.

primary_purpose_id_naics

North American Industry Classification System (NAICS) code that best describes the primary purpose of the reporting plant

regulatory_status_code

Indicates whether the plant is regulated or non-regulated.

reporting_frequency_code

Code that specifies what time period data has to be reported (i.e. monthly data or annual totals) and how often the power plant reports this data to EIA. See reporting_frequencies_eia for more details.

sector_id_eia

EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

sector_name_eia

EIA assigned sector name, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

service_area

Service area in which plant is located; for unregulated companies, it's the electric utility with which plant is interconnected

transmission_distribution_owner_id

EIA-assigned code for owner of transmission/distribution system to which the plant is interconnected.

transmission_distribution_owner_name

Name of the owner of the transmission or distribution system to which the plant is interconnected.

transmission_distribution_owner_state

State location for owner of transmission/distribution system to which the plant is interconnected.

utility_id_eia

The EIA Utility Identification number.

water_source

Name of water source associated with the plant.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

utility_name_eia

The name of the utility.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

balancing_authority_code_eia_consistent_rate

Percentage consistency of balancing authority code across entity records.

out_eia__yearly_utilities

package: pudl

Annual time series of all EIA utility attributes.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

utility_id_eia, report_date

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This table inherits canonicalized values for utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_eia

The name of the utility.

report_date

Date reported.

street_address

Physical street address.

city

Name of the city.

state

Two letter US state abbreviation.

zip_code

Five digit US Zip Code.

plants_reported_owner

Is the reporting entity an owner of power plants reported on Schedule 2 of the form?

plants_reported_operator

Is the reporting entity an operator of power plants reported on Schedule 2 of the form?

plants_reported_asset_manager

Is the reporting entity an asset manager of power plants reported on Schedule 2 of the form?

plants_reported_other_relationship

Does the reporting entity have any other relationship to the power plants reported on Schedule 2 of the form?

entity_type

Entity type of principal owner.

attention_line

Mail attention name of the operator/owner.

address_2

Second line of the address.

zip_code_4

Four digit US Zip Code suffix.

contact_firstname

First name of utility contact 1.

contact_lastname

Last name of utility contact 1.

contact_title

Title of of utility contact 1.

phone_number

Phone number for utility contact 1.

phone_extension

Phone extension for utility contact 1

contact_firstname_2

First name of utility contact 2.

contact_lastname_2

Last name of utility contact 2.

contact_title_2

Title of utility contact 2.

phone_number_2

Phone number for utility contact 2.

phone_extension_2

Phone extension for utility contact 2

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

out_ferc1__yearly_balance_sheet_assets_sched110

package: pudl

Annual time series of utility assets and other debits.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110)

Primary key:

utility_id_ferc1, report_year, asset_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

asset_type

Type of asset being reported to the core_ferc1__yearly_balance_sheet_assets_sched110 table.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ending_balance

Account balance at end of year.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

starting_balance

Account balance at beginning of year.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

out_ferc1__yearly_balance_sheet_liabilities_sched110

package: pudl

Annual time series of utility liabilities and other credits.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110)

Primary key:

utility_id_ferc1, report_year, liability_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

starting_balance

Account balance at beginning of year.

ending_balance

Account balance at end of year.

liability_type

Type of liability being reported to the core_ferc1__yearly_balance_sheet_liabilities_sched110 table.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

out_ferc1__yearly_cash_flows_sched120

package: pudl

Annual time series of utility cash flow.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 120)

Primary key:

utility_id_ferc1, report_year, amount_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

amount_type

Label describing the type of amount being reported. This could be a balance or a change in value.

amount

Reported amount of dollars. This could be a balance or a change in value.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

out_ferc1__yearly_depreciation_by_function_sched219

package: pudl

Annual time series of ending balances in accumulated provision for depreciation of electric utility plant.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 219 - Section B)

Primary key:

utility_id_ferc1, report_year, depreciation_type, plant_function, plant_status, utility_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric utility plant refers to FERC Account 108. Section B includes the Balances at End of Year According to Functional Classification (plant_function).

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

depreciation_type

Type of depreciation provision within FERC Account 108, including cost ofremoval, depreciation expenses, salvage, cost of retired plant, etc.

plant_function

Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).

plant_status

Utility plant financial status (in service, future, leased, total).

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

ending_balance

Account balance at end of year.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

out_ferc1__yearly_depreciation_changes_sched219

package: pudl

Annual time series of changes in accumulated provision for depreciation of electric utility plant.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 219 - Section A)

Primary key:

utility_id_ferc1, report_year, depreciation_type, plant_status, utility_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric utility plant refers to FERC Account 108.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

depreciation_type

Type of depreciation provision within FERC Account 108, including cost ofremoval, depreciation expenses, salvage, cost of retired plant, etc.

plant_status

Utility plant financial status (in service, future, leased, total).

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

dollar_value

Dollar value of reported income, expense, asset, or liability.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

out_ferc1__yearly_depreciation_summary_sched336

package: pudl

Annual time series of depreciation and amortization of electric plant.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 336 - Section A)

Primary key:

utility_id_ferc1, report_year, plant_function, ferc_account_label

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric Plant refers to FERC Accounts 403, 404, and 405. This table only contains information from Section A: Summary of depreciation and amortization changes.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_function

Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).

ferc_account_label

Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

dollar_value

Dollar value of reported income, expense, asset, or liability.

out_ferc1__yearly_detailed_balance_sheet_assets

package: pudl

Annual time series of granular, de-duplicated accounting data of utilities' balance sheet assets.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedules 110, 200, 204 and 219)

Primary key:

This table has no primary key.

Usage Warnings

  • The data from these xbrl_factoid tables contains nested totals and subtotals - making aggregations difficult. We used FERC 1's reported calculations to determine the most granular, non-duplicative records. The reported calculations are not always perfect - we've corrected many of them, but errors could still exist.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table is derived from four FERC Form 1 accounting tables with nested calculations:

  • core_ferc1__yearly_balance_sheet_assets_sched110

  • core_ferc1__yearly_utility_plant_summary_sched200

  • core_ferc1__yearly_plant_in_service_sched204

  • core_ferc1__yearly_depreciation_by_function_sched219

We reconciled the nested calculations within these tables and then identified the most granular data across the tables. We applied slight modifications to three columns (utility_type, plant_function & plant_status) as compared to the originally reported values in our core tables. The modifications were applied to either provide more specificity (i.e. we converted some total utility_type's into electric) or to condense similar categories for easier analysis (i.e. creating a hydraulic_production plant_function by combining hydraulic_production_conventional and hydraulic_production_pumped_storage). See pudl.output.ferc1.Exploder for more details. This table was made entirely with support and direction from RMI.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

xbrl_factoid

The name of type of value which is a derivative of the XBRL fact name.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

plant_function

Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).

plant_status

Utility plant financial status (in service, future, leased, total).

ending_balance

Account balance at end of year.

utility_type_other

Freeform description of type of utility reported in one of the other three other utility_type sections in the core_ferc1__yearly_utility_plant_summary_sched200 table. This field is reported only in the DBF reporting years (1994-2020).

in_rate_base

Whether or not a record from the detailed FERC1 accounting tables should be considered allowable in a utility's rate base based on utility accounting standards. This flag was manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes varies based on the utility_type, plant_status or plant_function.

rate_base_category

A category of asset or liability that RMI compiled to use as a shorthand for various types of utility assets. These tags were compiled manually based on the xbrl_factoid and sometimes varies based on the utility_type, plant_function or plant_status as well.

table_name

The name of the PUDL database table where a given record originated from.

out_ferc1__yearly_detailed_balance_sheet_liabilities

package: pudl

Annual time series of granular, de-duplicated accounting data of utilities' balance sheet liabilities.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110 and 118)

Primary key:

This table has no primary key.

Usage Warnings

  • The data from these xbrl_factoid tables contains nested totals and subtotals - making aggregations difficult. We used FERC 1's reported calculations to determine the most granular, non-duplicative records. The reported calculations are not always perfect - we've corrected many of them, but errors could still exist.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table is derived from two FERC Form 1 accounting tables with nested calculations:

  • core_ferc1__yearly_balance_sheet_liabilities_sched110

  • core_ferc1__yearly_retained_earnings_sched118

We reconciled the nested calculations within these tables and then identified the most granular data across the tables. We applied slight modifications to three columns (utility_type, plant_function & plant_status) as compared to the originally reported values in our core tables. The modifications were applied to either provide more specificity (i.e. we converted some total utility_type's into electric) or to condense similar categories for easier analysis (i.e. creating a hydraulic_production plant_function by combining hydraulic_production_conventional and hydraulic_production_pumped_storage). See pudl.output.ferc1.Exploder for more details. This table was made entirely with support and direction from RMI.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

xbrl_factoid

The name of type of value which is a derivative of the XBRL fact name.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

ending_balance

Account balance at end of year.

in_rate_base

Whether or not a record from the detailed FERC1 accounting tables should be considered allowable in a utility's rate base based on utility accounting standards. This flag was manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes varies based on the utility_type, plant_status or plant_function.

rate_base_category

A category of asset or liability that RMI compiled to use as a shorthand for various types of utility assets. These tags were compiled manually based on the xbrl_factoid and sometimes varies based on the utility_type, plant_function or plant_status as well.

table_name

The name of the PUDL database table where a given record originated from.

out_ferc1__yearly_detailed_income_statements

package: pudl

Annual time series of granular, de-duplicated accounting data of utilities' income statements.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedules 114, 300, 320 and 336)

Primary key:

This table has no primary key.

Usage Warnings

  • The data from these xbrl_factoid tables contains nested totals and subtotals - making aggregations difficult. We used FERC 1's reported calculations to determine the most granular, non-duplicative records. The reported calculations are not always perfect - we've corrected many of them, but errors could still exist.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table is derived from four FERC Form 1 accounting tables with nested calculations:

  • core_ferc1__yearly_income_statements_sched114

  • core_ferc1__yearly_depreciation_summary_sched336

  • core_ferc1__yearly_operating_expenses_sched320

  • core_ferc1__yearly_operating_revenues_sched300

We reconciled the nested calculations within these tables and then identified the most granular data across the tables. We applied slight modifications to two columns (utility_type & plant_function) as compared to the originally reported values in our core tables. The modifications were applied to either provide more specificity (i.e. we converted some total utility_type's into electric) or to condense similar categories for easier analysis (i.e. creating a hydraulic_production plant_function by combining hydraulic_production_conventional and hydraulic_production_pumped_storage). See pudl.output.ferc1.Exploder for more details. This table was made entirely with support and direction from RMI.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

xbrl_factoid

The name of type of value which is a derivative of the XBRL fact name.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

plant_function

Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).

revenue_requirement_technology

The technology type associated with components of a utility's revenue requirement. These categories were manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes vary based on the utility_type or plant_function as well. This column is intended to be used to aggregate this table.

dollar_value

Dollar value of reported income, expense, asset, or liability.

in_revenue_requirement

Whether or not a record from the detailed income statement data is typically included in a utility's revenue requirement. This flag was manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes varies based on the utility_type or plant_function.

revenue_requirement_category

The category of revenue requirement associated with each component of utility'sincome statements. These categories were manually compiled by RMI utility accounting experts based on the xbrl_factoid and sometimes vary based on the utility_type or plant_function. This column is intended to be used to aggregate this table.

table_name

The name of the PUDL database table where a given record originated from.

out_ferc1__yearly_energy_dispositions_sched401

package: pudl

Annual time series of dispositions of electric energy sold, exchanged, or stored.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 401a)

Primary key:

utility_id_ferc1, report_year, energy_disposition_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric Energy Account, dispositions only. Schedule 401a. Electricity utilities delivered to end users, internal losses, etc.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

energy_disposition_type

Type of energy disposition reported in the core_ferc1__yearly_energy_dispositions_sched401. Dispositions include sales to customers, re-sales of energy, energy used internally, losses, etc.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

energy_mwh

Sources and uses of energy in MWh.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

out_ferc1__yearly_energy_sources_sched401

package: pudl

Annual time series of sources of electric energy generated or purchased, exchanged and wheeled.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 401a)

Primary key:

utility_id_ferc1, report_year, energy_source_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric Energy Account, sources only. Schedule 401a. Amount of electricity the utility obtained from each of several sources.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

energy_source_type

Type of energy source reported in the core_ferc1__yearly_energy_sources_sched401 table. There are three groups of energy sources: generation, power exchanges and transmission.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

energy_mwh

Sources and uses of energy in MWh.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

out_ferc1__yearly_hydroelectric_plants_sched406

package: pudl

Annual time series of plant statistics for large hydroelectric generating plants.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 406)

Primary key:

record_id The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.

Usage Warnings

  • The plant_type and construction_type are standardized into categorical values from free-form strings.

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Large plants have an installed nameplate capacity of more than 10 MW.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

asset_retirement_cost

Asset retirement cost (USD).

avg_num_employees

The average number of employees assigned to each plant.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

capex_equipment

Cost of plant: equipment (USD).

capex_facilities

Cost of plant: reservoirs, dams, and waterways (USD).

capex_land

Cost of plant: land and land rights (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_roads

Cost of plant: roads, railroads, and bridges (USD).

capex_structures

Cost of plant: structures and improvements (USD).

capex_total

Total cost of plant (USD).

construction_type

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

Year the plant's oldest still operational unit was built.

installation_year

Year the plant's most recently built unit was installed.

net_capacity_adverse_conditions_mw

Net plant capability under the least favorable operating conditions, in megawatts.

net_capacity_favorable_conditions_mw

Net plant capability under the most favorable operating conditions, in megawatts.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

opex_dams

Production expenses: maintenance of reservoirs, dams, and waterways (USD).

opex_electric

Production expenses: electric expenses (USD).

opex_engineering

Production expenses: maintenance, supervision, and engineering (USD).

opex_generation_misc

Production expenses: miscellaneous power generation expenses (USD).

opex_hydraulic

Production expenses: hydraulic expenses (USD).

opex_misc_plant

Production expenses: maintenance of miscellaneous hydraulic plant (USD).

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_per_mwh

Total production expenses (USD per MWh generated).

opex_plant

Production expenses: maintenance of electric plant (USD).

opex_rents

Production expenses: rents (USD).

opex_structures

Production expenses: maintenance of structures (USD).

opex_total

Total production expenses, excluding fuel (USD).

opex_total_nonfuel

Total production expenses, excluding fuel (USD).

opex_water_for_power

Production expenses: water for power (USD).

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_hours_connected_while_generating

Hours the plant was connected to load while generating in the report year.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_type

Type of plant.

project_num

FERC Licensed Project Number.

out_ferc1__yearly_income_statements_sched114

package: pudl

Annual time series of utility income statements.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 114)

Primary key:

utility_id_ferc1, report_year, utility_type, income_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

income_type

Type of income reported in core_ferc1__yearly_income_statements_sched114 table.

dollar_value

Dollar value of reported income, expense, asset, or liability.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

out_ferc1__yearly_operating_expenses_sched320

package: pudl

Annual time series of operating and maintenance costs associated with producing electricity.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 320)

Primary key:

utility_id_ferc1, report_year, expense_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

dollar_value

Dollar value of reported income, expense, asset, or liability.

expense_type

The type of expense.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

out_ferc1__yearly_operating_revenues_sched300

package: pudl

Annual time series of utilities' electric operating revenues.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 300)

Primary key:

utility_id_ferc1, report_year, revenue_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table includes only the structured part of schedule 300. There are a number of revenue_type's that do not have sales_mwh,or avg_customers_per_month provided, in which case these columnswill be NULL.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

revenue_type

Label describing types of revenues.

dollar_value

Dollar value of reported income, expense, asset, or liability.

sales_mwh

Quantity of electricity sold in MWh.

avg_customers_per_month

Average number of customers per month.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

out_ferc1__yearly_other_regulatory_liabilities_sched278

package: pudl

Annual time series of utilities' other regulatory liabilities, including rate order docket number.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 278)

Primary key:

This table has no primary key. Respondents are able to enter any number of liabilities across many rows. There are no IDs or set fields enforced in the original table.

Usage Warnings

  • The description column is a free-form string.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

description

Long human-readable description of the meaning of a code/label.

ending_balance

Account balance at end of year.

starting_balance

Account balance at beginning of year.

increase_in_other_regulatory_liabilities

The increase during the reporting period of other regulatory liabilities.

account_detail

Description of the account number credited from making debit adjustment to other regulatory liabilities.

decrease_in_other_regulatory_liabilities

The decrease during the reporting period of other regulatory liabilities.

out_ferc1__yearly_plant_in_service_sched204

package: pudl

Annual time series of utilities' balances and changes to FERC Electric Plant in Service accounts.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 204)

Primary key:

utility_id_ferc1, report_year, ferc_account_label

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Account numbers correspond to the FERC Uniform System of Accounts for Electric Plant, which is defined in Code of Federal Regulations (CFR) Title 18, Chapter I, Subchapter C, Part 101. (See e.g. https://www.law.cornell.edu/cfr/text/18/part-101). Each FERC respondent reports starting and ending balances for each account annually. Balances are organization wide, and are not broken down on a per-plant basis. End of year balance should equal beginning year balance plus the sum of additions, retirements, adjustments, and transfers.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

plant_status

Utility plant financial status (in service, future, leased, total).

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

additions

Cost of acquisition of items classified within the account.

adjustments

Cost of adjustments to the account.

ending_balance

Account balance at end of year.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

ferc_account_label

Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.

retirements

Cost of disposal of items classified within the account.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

starting_balance

Account balance at beginning of year.

transfers

Cost of transfers into (out of) the account.

out_ferc1__yearly_pumped_storage_plants_sched408

package: pudl

Annual time series of plant statistics for hydroelectric pumped storage plants with an installed nameplate capacity of 10+ MW.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 408)

Primary key:

record_id The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.

Usage Warnings

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

As reported in Schedule 408 of FERC Form 1.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

asset_retirement_cost

Asset retirement cost (USD).

avg_num_employees

The average number of employees assigned to each plant.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

capex_equipment_electric

Cost of plant: accessory electric equipment (USD).

capex_equipment_misc

Cost of plant: miscellaneous power plant equipment (USD).

capex_facilities

Cost of plant: reservoirs, dams, and waterways (USD).

capex_land

Cost of plant: land and land rights (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_roads

Cost of plant: roads, railroads, and bridges (USD).

capex_structures

Cost of plant: structures and improvements (USD).

capex_total

Total cost of plant (USD).

capex_wheels_turbines_generators

Cost of plant: water wheels, turbines, and generators (USD).

construction_type

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

Year the plant's oldest still operational unit was built.

energy_used_for_pumping_mwh

Energy used for pumping, in megawatt-hours.

installation_year

Year the plant's most recently built unit was installed.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

net_load_mwh

Net output for load (net generation - energy used for pumping) in megawatt-hours.

opex_dams

Production expenses: maintenance of reservoirs, dams, and waterways (USD).

opex_electric

Production expenses: electric expenses (USD).

opex_engineering

Production expenses: maintenance, supervision, and engineering (USD).

opex_generation_misc

Production expenses: miscellaneous power generation expenses (USD).

opex_misc_plant

Production expenses: maintenance of miscellaneous hydraulic plant (USD).

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_per_mwh

Total production expenses (USD per MWh generated).

opex_plant

Production expenses: maintenance of electric plant (USD).

opex_production_before_pumping

Total production expenses before pumping (USD).

opex_pumped_storage

Production expenses: pumped storage (USD).

opex_pumping

Production expenses: We are here to PUMP YOU UP! (USD).

opex_rents

Production expenses: rents (USD).

opex_structures

Production expenses: maintenance of structures (USD).

opex_total

Total production expenses, excluding fuel (USD).

opex_total_nonfuel

Total production expenses, excluding fuel (USD).

opex_water_for_power

Production expenses: water for power (USD).

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_capability_mw

Net plant capability in megawatts.

plant_hours_connected_while_generating

Hours the plant was connected to load while generating in the report year.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

project_num

FERC Licensed Project Number.

out_ferc1__yearly_purchased_power_and_exchanges_sched326

package: pudl

Annual time series of purchased power (Account 555) including power exchanges (transactions involving a balancing of debits and credits for energy, capacity, etc.) and any settlements for imbalanced exchanges.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 326)

Primary key:

This table has no primary key. The primary key for this table would be record_id, however a small number of these values are duplicated in 2021, 2022, and 2024.

Usage Warnings

  • Contains columns which may appear categorical, but are actually free text.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table has data about inter-utility power purchases. This includes how much electricity was purchased, how much it cost, and who it was purchased from. Unfortunately the field describing which other utility the power was being bought from (seller_name) is poorly standardized, making it difficult to correlate with other data.

Purchased Power is considered FERC Account 555 according to FERC's Uniform System of Accounts. Reported on pages 326-327 of FERC Form 1.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

seller_name

Name of the seller, or the other party in an exchange transaction.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

billing_demand_mw

Monthly average billing demand (for requirements purchases, and any transactions involving demand charges). In megawatts.

coincident_peak_demand_mw

Average monthly coincident peak (CP) demand (for requirements purchases, and any transactions involving demand charges). Monthly CP demand is the metered demand during the hour (60-minute integration) in which the supplier's system reaches its monthly peak. In megawatts.

delivered_mwh

Gross megawatt-hours delivered in power exchanges and used as the basis for settlement.

demand_charges

Demand charges (USD).

energy_charges

Energy charges (USD).

non_coincident_peak_demand_mw

Average monthly non-coincident peak (NCP) demand (for requirements purhcases, and any transactions involving demand charges). Monthly NCP demand is the maximum metered hourly (60-minute integration) demand in a month. In megawatts.

other_charges

Other charges, including out-of-period adjustments (USD).

purchase_type_code

Categorization based on the original contractual terms and conditions of the service. Must be one of 'requirements', 'long_firm', 'intermediate_firm', 'short_firm', 'long_unit', 'intermediate_unit', 'electricity_exchange', 'other_service', or 'adjustment'. Requirements service is ongoing high reliability service, with load integrated into system resource planning. 'Long term' means 5+ years. 'Intermediate term' is 1-5 years. 'Short term' is less than 1 year. 'Firm' means not interruptible for economic reasons. 'unit' indicates service from a particular designated generating unit. 'exchange' is an in-kind transaction.

purchased_mwh

Megawatt-hours shown on bills rendered to the respondent. Includes both electricity purchased for storage and non-storage purposes, which were lumped together prior to 2021.

purchased_storage_mwh

Number of megawatt hours purchased during the period for energy storage.

purchased_other_than_storage_mwh

Number of megawatt hours purchased during the period for other than energy storage.

received_mwh

Gross megawatt-hours received in power exchanges and used as the basis for settlement.

tariff

FERC Rate Schedule Number or Tariff. (Note: may be incomplete if originally reported on multiple lines.)

total_settlement

Sum of demand, energy, and other charges (USD). For power exchanges, the settlement amount for the net receipt of energy. If more energy was delivered than received, this amount is negative.

out_ferc1__yearly_rate_base

package: pudl

Annual time series of granular accounting data consisting of what utilities can typically include in their rate bases.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedules 110, 118, 200, 204 219 and 320)

Primary key:

This table has no primary key.

Usage Warnings

  • The data from these xbrl_factoid tables contains nested totals and subtotals - making aggregations difficult. We used FERC 1's reported calculations to determine the most granular, non-duplicative records. The reported calculations are not always perfect - we've corrected many of them, but errors could still exist.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table is derived from seven FERC Form 1 accounting tables with nested calculations. We reconciled these nested calculations and then identified the most granular data across the tables. Here are the three direct upstream inputs - the two detailed tables have several core_ferc1 inputs each:

  • out_ferc1__yearly_detailed_balance_sheet_assets

  • out_ferc1__yearly_detailed_balance_sheet_liabilities

  • core_ferc1__yearly_operating_expenses_sched320

We applied slight modifications to three columns (utility_type, plant_function & plant_status) as compared to the originally reported values in our core tables. The modifications were applied to either provide more specificity (i.e. we converted some total utility_type's into electric) or to condense similar categories for easier analysis (i.e. creating a hydraulic_production plant_function by combining hydraulic_production_conventional and hydraulic_production_pumped_storage). See pudl.output.ferc1.Exploder for more details. This table was made entirely with support and direction from RMI.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

utility_type

Listing of utility plant types.

plant_function

Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).

plant_status

Utility plant financial status (in service, future, leased, total).

xbrl_factoid

The name of type of value which is a derivative of the XBRL fact name.

ending_balance

Account balance at end of year.

utility_type_other

Freeform description of type of utility reported in one of the other three other utility_type sections in the core_ferc1__yearly_utility_plant_summary_sched200 table. This field is reported only in the DBF reporting years (1994-2020).

rate_base_category

A category of asset or liability that RMI compiled to use as a shorthand for various types of utility assets. These tags were compiled manually based on the xbrl_factoid and sometimes varies based on the utility_type, plant_function or plant_status as well.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

is_disaggregated_utility_type

Indicates whether or not records with null or total values in the utility_type column were disaggregated. See documentation for process: pudl.output.ferc1.disaggregate_null_or_total_tag

is_disaggregated_in_rate_base

Indicates whether or not records with null values in the in_rate_base column were disaggregated. See documentation for process: pudl.output.ferc1.disaggregate_null_or_total_tag

table_name

The name of the PUDL database table where a given record originated from.

out_ferc1__yearly_retained_earnings_sched118

package: pudl

Annual time series of utilities' statements of retained earnings.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 118)

Primary key:

utility_id_ferc1, report_year, earnings_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

earnings_type

Label describing types of earnings.

starting_balance

Account balance at beginning of year.

ending_balance

Account balance at end of year.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

out_ferc1__yearly_sales_by_rate_schedules_sched304

package: pudl

Annual time series of utilities' electric sales from all rate schedules in effect throughout the year.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 304)

Primary key:

This table has no primary key.

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Values in rate_schedule_description are free-form strings.

  • Data prior to 2021 does not include information in columns: rate_schedule_type and billing_status.

  • Units of revenue_per_kwh are suspected to include a mix of dollars and possibly cents.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

The pre-2021 data in this table (extracted from FoxProDB vs. XBRL) is extremely unstructured. Where the post-2020 data (from XBRL) sorts the data into rate schedule types: residential, industrial, commercial, public_lighting, public_authorities, railroads, interdepartmental, provision_for_rate_refund, commercial_and_industrial, total, and billing status: billed, unbilled, total, the pre-2021 data stuffs all of that information (if you're lucky) into the rate_schedule_description column. There's no point trying to parse through the pre 2021 rate_schedule_description column en masse because it's just too messy. The contents of rate_schedule_description often contain numbers and acronyms that have little to no meaning out of context. The table is structured somewhat like the FERC1 small generators table with headings about rate structure type also embedded into the rate_schedule_description column. To all who dare, beware.

This table is a combination of one pre-2021 (DBF) table and nine post-2020 (XBRL) tables--one for each rate schedule type plus totals--hence increase in data clarity post-2020. The rate_schedule_type and billing_status columns are only relevant for post-2020 data as they can be reliably parsed from each of the tables and incorporated into columns. The rate_schedule_description is supposed to contain sub-rate_schedule_type names for charges (Ex: Residential 1, Residential 2, etc.). However, the pre-2021 data contains a little bit of everything (or nothing) and the post-2020 has some totals or wonky data thrown in. That's to say, even when working with post-2020 data, be wary of aggregating the data. That's what the "total" rows are for.

The values that come from from the totals table are marked with the string "total" in the rate_schedule_description column. The totals table is a product of the transition to XBRL, so these distinguishable totals are only available for data post-2020 (otherwise you could try keyword searching for "total" in rate_schedule_description). The total table contains two types of totals, the utility totals across all rate schedules in a given year (marked with rate_schedule_description = "total" and rate_schedule_type = "total") and each of the utility's individual rate schedule totals in a given year (marked with rate_schedule_description = "total" and rate_schedule_type = "residential" or any other rate schedule type).

The rate schedule based XBRL tables only report billed values whereas the total tables report billed, unbilled, and total values. (See the column description for more info on the difference between billed and unbilled). This is important to consider if you're endeavoring to compare the subtotal values with the total values. We have not attempted to fix or verify any subtotals or totals that don't add up.

Another important note is the possibility of unit discrepancies in certain columns. The revenue_per_kwh column does not specify reporting units, and closer inspection of the data reveals two clear peaks approximate two orders of magnitude apart. This indicates that values may be reported in both dollars and cents. However, because the price of energy per kwh varies so much regionally, we cannot guarantee which is which and have not put any cleaning mechanisms in place to account for this.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

rate_schedule_type

Categorization of rate schedule type.

billing_status

Whether an amount is billed, unbilled, or both. Billed amounts pertain to the exchange of energy and unbilled amounts pertain to other sources of revenue such as contracts with peaker plants to keep them on standby or charging rent to host cell antennas on transmission towers.

rate_schedule_description

Free-form description of what the rate schedule name is. Not standardized. Often a sub-category of rate_schedule_type.

sales_mwh

Quantity of electricity sold in MWh.

dollar_value

Dollar value of reported income, expense, asset, or liability.

avg_customers_per_month

Average number of customers per month.

kwh_per_customer

kWh per customer.

revenue_per_kwh

The amount of revenue per kWh by rate schedule acquired in the given report year.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

out_ferc1__yearly_small_plants_sched410

package: pudl

Annual time series of plant statistics for internal combustion plants, gas turbine-plants, conventional hydro plants, and pumped storage plants with less than 10 MW installed nameplate capacity and steam plants with less than 25 MW installed nameplate capacity.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 410)

Primary key:

record_id The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.

Usage Warnings

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

As reported on FERC Form 1 Schedule 410 (pages 410-411) and extracted from the FERC Visual FoxPro and XBRL. See our pudl.extract.ferc1.TABLE_NAME_MAP_FERC1 for links to the raw tables.

The raw version of this table is more like a digitized PDF than an actual data table. The rows contain lots of information in addition to what the columns might suggest. For instance, a single column may contain header rows, note rows, and total rows. This extraneous information is useful, but it prevents proper analysis when mixed in with the rest of the values data in the column. We employ a couple of data transformations to extract these rows from the data and preserve some of the information they contain (fuel type, plant type, FERC license, or general notes about the plant) in separate columns.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_total

Total cost of plant (USD).

construction_year

Year the plant's oldest still operational unit was built.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_type

Type of fuel.

license_id_ferc1

FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

opex_fuel

Production expenses: fuel (USD).

opex_maintenance

Production expenses: Maintenance (USD).

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_total

Total production expenses, excluding fuel (USD).

opex_total_nonfuel

Total production expenses, excluding fuel (USD).

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_type

Type of plant.

out_ferc1__yearly_steam_plants_fuel_by_plant_sched402

package: pudl

Annual time series of FERC fuel data by plant.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)

Primary key:

report_year, utility_id_ferc1, plant_name_ferc1

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

coal_fraction_cost

Coal cost as a percentage of overall fuel cost.

coal_fraction_mmbtu

Coal heat content as a percentage of overall fuel heat content (mmBTU).

fuel_cost

Total fuel cost for plant (in $USD).

fuel_mmbtu

Total heat content for plant (in MMBtu).

gas_fraction_cost

Natural gas cost as a percentage of overall fuel cost.

gas_fraction_mmbtu

Natural gas heat content as a percentage of overall fuel heat content (MMBtu).

nuclear_fraction_cost

Nuclear cost as a percentage of overall fuel cost.

nuclear_fraction_mmbtu

Nuclear heat content as a percentage of overall fuel heat content (MMBtu).

oil_fraction_cost

Oil cost as a percentage of overall fuel cost.

oil_fraction_mmbtu

Oil heat content as a percentage of overall fuel heat content (MMBtu).

primary_fuel_by_cost

Primary fuel for plant as a percentage of cost.

primary_fuel_by_mmbtu

Primary fuel for plant as a percentage of heat content.

waste_fraction_cost

Waste-heat cost as a percentage of overall fuel cost.

waste_fraction_mmbtu

Waste-heat heat content as a percentage of overall fuel heat content (MMBtu).

out_ferc1__yearly_steam_plants_fuel_sched402

package: pudl

Annual time series of fuel cost and quantity for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)

Primary key:

record_id

Usage Warnings

  • The fuel_type_code_pudl is inferred from a free-form string field.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table is a subset of the steam plant table reported on page 402 of FERC Form 1.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_total_cost

Total cost of consumed fuel.

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_cost_per_unit_burned

Average cost of fuel consumed in the report year per reported fuel unit (USD).

fuel_cost_per_unit_delivered

Average cost of fuel delivered in the report year per reported fuel unit (USD).

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_units

Reported unit of measure for fuel.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

out_ferc1__yearly_steam_plants_sched402

package: pudl

Annual time series of plant statistics for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)

Primary key:

record_id The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.

Usage Warnings

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_id_ferc1

Algorithmically assigned PUDL FERC Plant ID. WARNING: NOT STABLE BETWEEN PUDL DB INITIALIZATIONS.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

asset_retirement_cost

Asset retirement cost (USD).

avg_num_employees

The average number of employees assigned to each plant.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

capex_annual_addition

Annual capital addition into capex_total.

capex_annual_addition_rolling

Year-to-date capital addition into capex_total.

capex_annual_per_kw

Annual capital addition into capex_total per kw.

capex_annual_per_mw

Annual capital addition into capex_total per MW.

capex_annual_per_mw_rolling

Year-to-date capital addition into capex_total per MW.

capex_annual_per_mwh

Annual capital addition into capex_total per MWh.

capex_annual_per_mwh_rolling

Year-to-date capital addition into capex_total per MWh.

capex_equipment

Cost of plant: equipment (USD).

capex_land

Cost of plant: land and land rights (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_structures

Cost of plant: structures and improvements (USD).

capex_total

Total cost of plant (USD).

capex_wo_retirement_total

Total cost of plant (USD) without retirements.

construction_type

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

Year the plant's oldest still operational unit was built.

installation_year

Year the plant's most recently built unit was installed.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

not_water_limited_capacity_mw

Plant capacity in MW when not limited by condenser water.

opex_allowances

Allowances.

opex_boiler

Maintenance of boiler (or reactor) plant.

opex_coolants

Cost of coolants and water (nuclear plants only)

opex_electric

Production expenses: electric expenses (USD).

opex_engineering

Production expenses: maintenance, supervision, and engineering (USD).

opex_fuel

Production expenses: fuel (USD).

opex_fuel_per_mwh

Production expenses: fuel (USD) per megawatt-hour (Mwh).

opex_misc_power

Miscellaneous steam (or nuclear) expenses.

opex_misc_steam

Maintenance of miscellaneous steam (or nuclear) plant.

opex_nonfuel_per_mwh

Investments in non-fuel production expenses per Mwh.

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_per_mwh

Total production expenses (USD per MWh generated).

opex_plants

Maintenance of electrical plant.

opex_production_total

Total operating expenses.

opex_rents

Production expenses: rents (USD).

opex_steam

Steam expenses.

opex_steam_other

Steam from other sources.

opex_structures

Production expenses: maintenance of structures (USD).

opex_total_nonfuel

Total production expenses, excluding fuel (USD).

opex_transfer

Steam transferred (Credit).

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_capability_mw

Net plant capability in megawatts.

plant_hours_connected_while_generating

Hours the plant was connected to load while generating in the report year.

plant_type

Type of plant.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

water_limited_capacity_mw

Plant capacity in MW when limited by condenser water.

out_ferc1__yearly_transmission_lines_sched422

package: pudl

Annual time series of statistics about transmission lines.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 422)

Primary key:

This table has no primary key. Each record of this table is supposed to represent one stretch of a transmission line, but there are no IDs and many nulls in the fields which would nominally distinguish unique transmission lines.

Usage Warnings

  • Contains columns which may appear categorical, but are actually free text.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Information describing transmission lines, the cost of lines, annual operating and capital expenses, etc. This table includes transmission lines having nominal voltage of 132 kilovolts or greater. Transmission lines below these voltages are required to be reported in group totals only for each voltage.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

start_point

The starting point of a transmission line.

end_point

The end point of a transmission line.

operating_voltage_kv

The operating voltage, expressed kilo-volts, for three-phase 60 cycle alternative current transmission lines.

designed_voltage_kv

Manufactured (Designed) voltage, expressed in kilo-volts, for three-phase 60 cycle alternative current transmission lines

supporting_structure_type

Supporting structure of the transmission line.

transmission_line_length_miles

Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are stand alone structures (whose cost are reported on a stand-alone basis).

transmission_line_and_structures_length_miles

Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are agrregated with other lines / structures (whose cost are aggregated and combined with other structures).

num_transmission_circuits

Number of circuits in a transmission line.

conductor_size_and_material

Size of transmission conductor and material of the transmission line.

capex_land

Cost of plant: land and land rights (USD).

capex_other

Other costs associated with the plant (USD).

capex_total

Total cost of plant (USD).

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_maintenance

Production expenses: Maintenance (USD).

opex_rents

Production expenses: rents (USD).

opex_total

Total production expenses, excluding fuel (USD).

out_ferc1__yearly_utility_plant_summary_sched200

package: pudl

Annual time series of utility plant and accumulated provisions for depreciation, amortization and depletion of utility plant assets.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 200)

Primary key:

utility_id_ferc1, report_year, utility_type, utility_plant_asset_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

utility_type_other

Freeform description of type of utility reported in one of the other three other utility_type sections in the core_ferc1__yearly_utility_plant_summary_sched200 table. This field is reported only in the DBF reporting years (1994-2020).

utility_plant_asset_type

Type of utility plant asset reported in the core_ferc1__yearly_utility_plant_summary_sched200 table. Assets include those leased to others, held for future use, construction work-in-progress and details of accumulated depreciation.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

ending_balance

Account balance at end of year.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

out_ferc714__georeferenced_respondents

package: pudl
Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report

Primary key:

respondent_id_ferc714, report_date

Usage Warnings

  • Contains information from multiple raw inputs.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

An annual summary of demand and other information about FERC-714 respondents.

This table differs from out_ferc714__summarized_demand in that it also includes a geometry column describing the respondent's service territory in each year. These service territories are based on the counties that the corresponding EIA-861 respondent reported serving in that year. There is sometimes ambiguity as to whether a FERC-714 respondent should be interpreted as an individual utility or a balancing authority. The respodent_type column indicates which type of entity has been assumed in determining the service territory from EIA-861 data.

Columns
report_date

Date reported.

respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

geometry

Geospatial representation of the feature.

eia_code

EIA utility or balancing area authority ID associated with this FERC Form 714 respondent. Note that many utilities are also balancing authorities and in many cases EIA uses the same integer ID to identify a utility in its role as a balancing authority AND as a utility, but there is no requirement that these IDs be the same, and in a number of cases they are different.

respondent_type

Whether a respondent to the FERC form 714 is a utility or a balancing authority.

respondent_name_ferc714

Name of the utility, balancing area authority, or planning authority responding to FERC Form 714.

balancing_authority_id_eia

EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

population

County population, sourced from Census DP1 data.

area_km2

County area in km2.

demand_annual_mwh

Annual electricity demand in a given report year.

out_ferc714__hourly_estimated_state_demand

package: pudl

Hourly time series of estimated electricity demand for each state, scaled such that it matches the total electricity sales by state reported in EIA 861.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report

Primary key:

state_id_fips, datetime_utc

Usage Warnings

  • Contains estimated values.

  • Contains information from multiple raw inputs.

  • Large table; do not attempt to open with Excel.

  • Contains information from multiple raw inputs.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table uses hourly electricity demand and information about service territories to estimate the total hourly electricity demand for each US state. Hourly demand is drawn from the FERC 714, which reports at the balancing authority and utility level. Service territories for utilities and balancing authorities are inferred from information in the EIA 861, including the counties served by each utility and the utilities that make up each balancing authority.

This table uses the total electricity sales by state reported in the EIA 861 as a scaling factor to ensure that the magnitude of electricity sales is roughly correct, and obtains the shape of the demand curve from the hourly planning area demand reported in the FERC 714.

This table includes the state FIPS codes - see out_censusdp1tract__states for additional information about FIPS state codes.

Columns
state_id_fips

Two digit state FIPS code.

datetime_utc

Date and time converted to Coordinated Universal Time (UTC).

demand_mwh

Electricity demand (energy) within a given timeframe.

scaled_demand_mwh

Estimated electricity demand scaled by the total sales within a state.

out_ferc714__hourly_planning_area_demand

package: pudl

Hourly time series of electricity demand by planning area.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report (Part III, Schedule 2a)

Primary key:

respondent_id_ferc714, datetime_utc

Usage Warnings

  • Contains rows where missing values were imputed.

  • The datetime_utc timestamps have been cleaned due to inconsistent datetime reporting. See core_ferc714__hourly_planning_area_demand for additional details.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table is based on core_ferc714__hourly_planning_area_demand, but adds imputed demand values where the original data was missing or anomalous. Codes explaining why values have been imputed can be found in the core_pudl__codes_imputation_reasons table.

Columns
respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

respondent_id_ferc714_csv

FERC Form 714 respondent ID from CSV reported data - published from years: 2006-2020. This ID is linked to the newer years of reported XBRL data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as respondent_id. Note that this ID does not correspond to FERC respondent IDs from other forms.

respondent_id_ferc714_xbrl

FERC Form 714 respondent ID from XBRL reported data - published from years: 2021-present. This ID is linked to the older years of reported CSV data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as entity_id. Note that this ID does not correspond to FERC respondent IDs from other forms.

report_date

Date reported.

datetime_utc

Date and time converted to Coordinated Universal Time (UTC).

timezone

IANA timezone name

demand_reported_mwh

Originally reported electricity demand, calculated by taking the net generation within the BA and subtracting the interchange with adjacent BAs.

demand_imputed_pudl_mwh

Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed in PUDL.

demand_imputed_pudl_mwh_imputation_code

Code describing why a demand value was flagged for imputation.

out_ferc714__respondents_with_fips

package: pudl

Annual respondents with the county FIPS IDs for their service territories.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report

Primary key:

This table has no primary key. Note that the state and county FIPS columns can be null. Otherwise the natural primary key would be: ['respondent_id_ferc714', 'report_date', 'county_id_fips']. The state FIPS code is not part of the primary key because the first two digits of the county FIPS code indicate the state.

Usage Warnings

  • Contains information from multiple raw inputs.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
eia_code

EIA utility or balancing area authority ID associated with this FERC Form 714 respondent. Note that many utilities are also balancing authorities and in many cases EIA uses the same integer ID to identify a utility in its role as a balancing authority AND as a utility, but there is no requirement that these IDs be the same, and in a number of cases they are different.

respondent_type

Whether a respondent to the FERC form 714 is a utility or a balancing authority.

respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

respondent_name_ferc714

Name of the utility, balancing area authority, or planning authority responding to FERC Form 714.

report_date

Date reported.

balancing_authority_id_eia

EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

state

Two letter US state abbreviation.

county

County name.

state_id_fips

Two digit state FIPS code.

county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4.

out_ferc714__summarized_demand

package: pudl

Summarized demand statistics and FERC-714 respondent attributes by respondent-year.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report

Primary key:

respondent_id_ferc714, report_date

Usage Warnings

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
report_date

Date reported.

respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

demand_annual_mwh

Annual electricity demand in a given report year.

population

County population, sourced from Census DP1 data.

area_km2

County area in km2.

population_density_km2

Average population per sq. km area of a service territory.

demand_annual_per_capita_mwh

Per-capita annual demand, averaged using Census county-level population estimates.

demand_density_mwh_km2

Annual demand per km2 of a given service territory.

eia_code

EIA utility or balancing area authority ID associated with this FERC Form 714 respondent. Note that many utilities are also balancing authorities and in many cases EIA uses the same integer ID to identify a utility in its role as a balancing authority AND as a utility, but there is no requirement that these IDs be the same, and in a number of cases they are different.

respondent_type

Whether a respondent to the FERC form 714 is a utility or a balancing authority.

respondent_name_ferc714

Name of the utility, balancing area authority, or planning authority responding to FERC Form 714.

balancing_authority_id_eia

EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

out_gridpathratoolkit__hourly_available_capacity_factor

package: pudl

Hourly time series of capacity factors defining the capacity available from an aggregated group of generators.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

GridPath Resource Adequacy Toolkit Data

Primary key:

datetime_utc, aggregation_group

Usage Warnings

  • Contains estimated values.

  • The data is no longer being collected or reported in this way.

Additional Details

This table contains a mix of profiles representing whole regions and individual plants, where the individual plants are hybrid wind or solar plus electricity storage facilities. Capacity factors are stated as a fraction of the aggregate nameplate capacity of the group.

For the hybrid facilities, the capacity factor represents the available output of only the renewable generators. Estimates of the generation profiles have been extended across a longer range of dates than the original data. For wind, the years 2015-2020 are synthesized data and for solar 2020 is synthesized. See the Appendix of the GridPath Resource Adequacy Toolkit report for details.

Columns
datetime_utc

Date and time converted to Coordinated Universal Time (UTC).

aggregation_group

A label identifying a group of aggregated generator capacity factors.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

out_pudl__yearly_assn_eia_ferc1_plant_parts

package: pudl

Annual Association table providing connections between power plant data reported in FERC Form 1 and any available EIA data related to the plant parts covered by each FERC Form 1 record.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

record_id_ferc1

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table answers the question "What EIA data reported about plants or generators should be associated with a given plant record found in the FERC Form 1?"

Each record in this table corresponds to a single FERC Form 1 record reported in one of several tables describing power plants (large steam, hydro, small, etc.). These FERC records can correspond to an entire plant, individual generators within a plant, all generators in a plant with the same prime mover type, or just the respondent's ownership share of any of those categories (or other categories). Furthermore, the same utility may report the same plant in different ways in different years.

The EIA data associated with each FERC plant record comes from our Plant Parts EIA table, out_eia__yearly_plant_parts. The EIA data in each record represents an aggregation of several slices of an EIA plant, across both physical characteristics and utility ownership.

This table inherits canonicalized values for generators and plants. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
record_id_ferc1

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

record_id_eia

Identifier for EIA plant parts analysis records.

match_type

Indicates the source and validation of the match between EIA and FERC. Match types include matches was generated from the model, verified by the training data, overridden by the training data, etc.

plant_name_ppe

Derived plant name that includes EIA plant name and other strings associated with ID and PK columns of the plant part.

plant_part

The part of the plant a record corresponds to.

report_year

Four-digit year in which the data was reported.

report_date

Date reported.

ownership_record_type

Whether each generator record is for one owner or represents a total of all ownerships.

plant_name_eia

Plant name.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

technology_description

High level description of the technology used by the generator to produce electricity.

ferc_acct_name

Name of FERC account, derived from technology description and prime mover code.

generator_operating_year

Year a generator went into service.

utility_id_eia

The EIA Utility Identification number.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

true_gran

Indicates whether a plant part list record is associated with the highest priority plant part for all identical records.

appro_part_label

Plant part of the associated true granularity record.

appro_record_id_eia

EIA record ID of the associated true granularity record.

record_count

Number of distinct generator IDs that participated in the aggregation for a plant part list record.

fraction_owned

Proportion of generator ownership attributable to this utility.

ownership_dupe

Whether a plant part record has a duplicate record with different ownership status.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

operational_status_pudl

The operating status of the asset using PUDL categories.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

total_fuel_cost_eia

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

fuel_cost_per_mmbtu_eia

Average fuel cost per mmBTU of heat content in nominal USD.

net_generation_mwh_eia

Net electricity generation for the specified period in megawatt-hours (MWh).

capacity_mw_eia

Total installed (nameplate) capacity, in megawatts.

capacity_factor_eia

Fraction of potential generation that was actually reported for a plant part.

total_mmbtu_eia

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

unit_heat_rate_mmbtu_per_mwh_eia

Fuel content per unit of electricity generated. Coming from MCOE calculation.

fuel_type_code_pudl_eia

Simplified fuel type code used in PUDL

installation_year_eia

Year the plant's most recently built unit was installed.

plant_part_id_eia

Contains EIA plant ID, plant part, ownership, and EIA utility id

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

plant_id_ferc1

Algorithmically assigned PUDL FERC Plant ID. WARNING: NOT STABLE BETWEEN PUDL DB INITIALIZATIONS.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

asset_retirement_cost

Asset retirement cost (USD).

avg_num_employees

The average number of employees assigned to each plant.

capacity_factor_ferc1

Fraction of potential generation that was actually reported for a plant part.

capacity_mw_ferc1

Total installed (nameplate) capacity, in megawatts.

capex_annual_addition

Annual capital addition into capex_total.

capex_annual_addition_rolling

Year-to-date capital addition into capex_total.

capex_annual_per_kw

Annual capital addition into capex_total per kw.

capex_annual_per_mw

Annual capital addition into capex_total per MW.

capex_annual_per_mw_rolling

Year-to-date capital addition into capex_total per MW.

capex_annual_per_mwh

Annual capital addition into capex_total per MWh.

capex_annual_per_mwh_rolling

Year-to-date capital addition into capex_total per MWh.

capex_equipment

Cost of plant: equipment (USD).

capex_land

Cost of plant: land and land rights (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

capex_structures

Cost of plant: structures and improvements (USD).

capex_total

Total cost of plant (USD).

capex_wo_retirement_total

Total cost of plant (USD) without retirements.

construction_type

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year_eia

Year the plant's oldest still operational unit was built.

construction_year_ferc1

Year the plant's oldest still operational unit was built.

installation_year_ferc1

Year the plant's most recently built unit was installed.

net_generation_mwh_ferc1

Net electricity generation for the specified period in megawatt-hours (MWh).

not_water_limited_capacity_mw

Plant capacity in MW when not limited by condenser water.

opex_allowances

Allowances.

opex_boiler

Maintenance of boiler (or reactor) plant.

opex_coolants

Cost of coolants and water (nuclear plants only)

opex_electric

Production expenses: electric expenses (USD).

opex_engineering

Production expenses: maintenance, supervision, and engineering (USD).

opex_fuel

Production expenses: fuel (USD).

fuel_cost_per_mwh_eia

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

fuel_cost_per_mwh_ferc1

Derived from MCOE, a unit level value. Average fuel cost per MWh of heat content in nominal USD.

opex_misc_power

Miscellaneous steam (or nuclear) expenses.

opex_misc_steam

Maintenance of miscellaneous steam (or nuclear) plant.

opex_nonfuel_per_mwh

Investments in non-fuel production expenses per Mwh.

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_per_mwh

Total production expenses (USD per MWh generated).

opex_plant

Production expenses: maintenance of electric plant (USD).

opex_production_total

Total operating expenses.

opex_rents

Production expenses: rents (USD).

opex_steam

Steam expenses.

opex_steam_other

Steam from other sources.

opex_structures

Production expenses: maintenance of structures (USD).

opex_total_nonfuel

Total production expenses, excluding fuel (USD).

opex_transfer

Steam transferred (Credit).

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_capability_mw

Net plant capability in megawatts.

plant_hours_connected_while_generating

Hours the plant was connected to load while generating in the report year.

plant_type

Type of plant.

water_limited_capacity_mw

Plant capacity in MW when limited by condenser water.

fuel_cost_per_mmbtu_ferc1

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_type

Type of fuel.

license_id_ferc1

FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.

opex_maintenance

Production expenses: Maintenance (USD).

opex_total

Total production expenses, excluding fuel (USD).

capex_facilities

Cost of plant: reservoirs, dams, and waterways (USD).

capex_roads

Cost of plant: roads, railroads, and bridges (USD).

net_capacity_adverse_conditions_mw

Net plant capability under the least favorable operating conditions, in megawatts.

net_capacity_favorable_conditions_mw

Net plant capability under the most favorable operating conditions, in megawatts.

opex_dams

Production expenses: maintenance of reservoirs, dams, and waterways (USD).

opex_generation_misc

Production expenses: miscellaneous power generation expenses (USD).

opex_hydraulic

Production expenses: hydraulic expenses (USD).

opex_misc_plant

Production expenses: maintenance of miscellaneous hydraulic plant (USD).

opex_water_for_power

Production expenses: water for power (USD).

ferc_license_id

The FERC license ID of a project.

capex_equipment_electric

Cost of plant: accessory electric equipment (USD).

capex_equipment_misc

Cost of plant: miscellaneous power plant equipment (USD).

capex_wheels_turbines_generators

Cost of plant: water wheels, turbines, and generators (USD).

energy_used_for_pumping_mwh

Energy used for pumping, in megawatt-hours.

net_load_mwh

Net output for load (net generation - energy used for pumping) in megawatt-hours.

opex_production_before_pumping

Total production expenses before pumping (USD).

opex_pumped_storage

Production expenses: pumped storage (USD).

opex_pumping

Production expenses: We are here to PUMP YOU UP! (USD).

total_fuel_cost_ferc1

Total annual reported fuel costs for the plant part. Includes costs from all fuels.

total_mmbtu_ferc1

Total annual heat content of fuel consumed by a plant part record in the plant parts list.

fuel_type_code_pudl_ferc1

Simplified fuel type code used in PUDL

unit_heat_rate_mmbtu_per_mwh_ferc1

Fuel content per unit of electricity generated. Calculated from FERC reported fuel consumption and net generation.

out_sec10k__changelog_company_name

package: pudl

Changelog table tracking changes in the names each SEC 10-K filer has used.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

This table has no primary key.

Usage Warnings

  • PUDL does not currently update its copy of this data.

Additional Details

We use the company name reported in association with each name change block in the company information table to fill in the most recent value of company_name_new. Roughly 1,000 reported "name changes" in which the old and new names were identical have been dropped.

Columns
central_index_key

Identifier of the company in SEC database.

name_change_date

Date of last name change of the company.

company_name_old

Name of company prior to name change.

company_name_new

Name of company after name change.

out_sec10k__parents_and_subsidiaries

package: pudl

A denormalized table containing information about parent companies that file SEC Form 10-K and their subsidiaries, which may or may not file Form 10-K.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

filename_sec10k, subsidiary_company_id_sec10k

Usage Warnings

  • This data has been extracted probabilistically using a machine learning model and contains some incompletions and errors. It should not be treated as ground truth data.

  • PUDL does not currently update its copy of this data.

Additional Details

Company ownership fractions are extracted from SEC 10-K Exhibit 21. Information about the companies is extracted primarily from the headers of the SEC 10-K filing. Subsidiaries that file Form 10-K will have much more information available than those that only appear as subsidiaries in Exhibit 21.

SEC 10-K filers and EIA utilities are matched using probabilistic record linkage. Exhibit 21 subsidiaries that don't file a Form 10-K are matched to EIA utilities using the company name.

Columns
filename_sec10k

Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.

subsidiary_company_name

Name of subsidiary company.

subsidiary_company_location

Location of subsidiary company. This is the full US state name or country name and occasionally a two digit code that was not mapped to a full name during cleaning.

subsidiary_company_id_sec10k

PUDL-assigned ID for subsidiaries found in SEC 10-K Exhibit 21. The ID is created by concatenating the CIK of the company whose filing the subsidiary was found in, the subsidiary company's name, and location of incorporation. It is not guaranteed to be stable across different releases of PUDL and so should never be hard-coded in analyses.

fraction_owned

Proportion of generator ownership attributable to this utility.

parent_company_central_index_key

Central index key (CIK) of the parent company.

parent_company_name

Name of the parent company.

filing_date

Date on which the filing was submitted.

report_date

Date reported.

parent_company_phone_number

Phone number of the parent company.

parent_company_business_city

City where the parent company's place of business is located.

parent_company_business_state

State where the parent company's place of business is located.

parent_company_business_street_address

Street address of the parent company's place of business.

parent_company_business_street_address_2

Second line of the street address of the parent company's place of business.

parent_company_business_zip_code

Zip code of the parent company's place of business.

parent_company_business_zip_code_4

Zip code suffix of the company's place of business.

parent_company_mail_city

City of the parent company's mailing address.

parent_company_mail_state

State of the parent company's mailing address.

parent_company_mail_street_address

Street portion of the parent company's mailing address.

parent_company_mail_street_address_2

Second line of the street portion of the parent company's mailing address.

parent_company_mail_zip_code

Zip code of the parent company's mailing address.

parent_company_mail_zip_code_4

Zip code suffix of the parent company's mailing address.

parent_company_incorporation_state

Two letter state code where parent company is incorporated.

parent_company_utility_id_eia

The EIA utility ID of the parent company.

parent_company_utility_name_eia

The EIA reported utility name of the parent company.

parent_company_industry_name_sic

Text description of the parent company's Standard Industrial Classification (SIC)

parent_company_industry_id_sic

Four-digit Standard Industrial Classification (SIC) code identifying the parent company's primary industry. SIC codes have been replaced by NAICS codes in many applications, but are still used by the SEC. See e.g. https://www.osha.gov/data/sic-manual for code definitions.

parent_company_taxpayer_id_irs

Taxpayer ID of the parent company with the IRS.

subsidiary_company_central_index_key

Central index key (CIK) of the subsidiary company.

subsidiary_company_phone_number

Phone number of the subsidiary company.

subsidiary_company_business_city

City where the subsidiary company's place of business is located.

subsidiary_company_business_state

State where the subsidiary company's place of business is located.

subsidiary_company_business_street_address

Street address of the subsidiary company's place of business.

subsidiary_company_business_street_address_2

Second line of the street address of the subsidiary company's place of business.

subsidiary_company_business_zip_code

Zip code of the subsidiary company's place of business.

subsidiary_company_business_zip_code_4

Zip code suffix of the subsidiary company's place of business.

subsidiary_company_mail_city

City of the subsidiary company's mailing address.

subsidiary_company_mail_state

State of the parent company's mailing address.

subsidiary_company_mail_street_address

Street portion of the subsidiary company's mailing address.

subsidiary_company_mail_street_address_2

Second line of the street portion of the subsidiary company's mailing address.

subsidiary_company_mail_zip_code

Zip code of the subsidiary company's mailing address.

subsidiary_company_mail_zip_code_4

Zip code suffix of the subsidiary company's mailing address.

subsidiary_company_incorporation_state

Two letter state code where subisidary company is incorporated.

subsidiary_company_utility_id_eia

The EIA utility ID of the subsidiary company.

subsidiary_company_utility_name_eia

The EIA reported utility name of the subsidiary company.

subsidiary_company_industry_name_sic

Text description of the subsidiary company's Standard Industrial Classification (SIC)

subsidiary_company_industry_id_sic

Four-digit Standard Industrial Classification (SIC) code identifying the subsidiary company's primary industry. SIC codes have been replaced by NAICS codes in many applications, but are still used by the SEC. See e.g. https://www.osha.gov/data/sic-manual for code definitions.

subsidiary_company_taxpayer_id_irs

Taxpayer ID of the subsidiary company with the IRS.

out_sec10k__quarterly_company_information

package: pudl

Quarterly time series of company and filing data extracted from SEC 10-K filings.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

filename_sec10k, central_index_key

Usage Warnings

  • PUDL does not currently update its copy of this data.

Additional Details

In addition to the information provided by the core_sec10k__quarterly_company_information table, this output table merges in the associated utility_id_eia (and utility name) if it is available, as well as the report and filing dates associated with the filing each record was extracted from, as well as providing a link to the source URL for the filing.

Columns
filename_sec10k

Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.

central_index_key

Identifier of the company in SEC database.

filer_count

A counter indicating which observation of company data within an SEC 10-K filing header the record pertains to.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

report_date

Date reported.

filing_date

Date on which the filing was submitted.

company_name

Name of company submitting SEC 10k filing.

fiscal_year_end

The end date of an SEC filing company's fiscal year, in MMDD format.

taxpayer_id_irs

Taxpayer ID of the company with the IRS.

incorporation_state

Two letter state code where company is incorporated.

industry_name_sic

Text description of Standard Industrial Classification (SIC)

industry_group_sic

A higher level industry category defined within the Standard Industrial Classification (SIC) system.

industry_id_sic

Four-digit Standard Industrial Classification (SIC) code identifying the company's primary industry. SIC codes have been replaced by NAICS codes in many applications, but are still used by the SEC. See e.g. https://www.osha.gov/data/sic-manual for code definitions.

film_number

Document control number used in the SEC EDGAR database. The first four digits can be used to access scans of the document in the SEC's Virtual Private Reference Room.

sec10k_type

Specific version of SEC 10-K that was filed. 10-k: the standard annual report. 10-k/a: an amended version of the annual report. 10-k405: filed to report insider trading that was not reported in a timely fashion. 10-k405/a: an amended version of the 10-k405. 10-kt: submitted in lieu of or in addition to a standard 10-K annual report when a company changes the end of its fiscal year (e.g. due to a merger) leaving the company with a longer or shorter reporting period. 10-kt/a: an amended version of the 10-kt. 10-ksb: the annual report for small businesses, also known as penny stocks. 10-ksb/a: an amended version of the 10-ksb.

sec_act

SEC Act through which the form was enacted, e.g. 1934 act.

filing_number_sec

Filing number used internally by the SEC commission to track filing.

phone_number

Phone number for utility contact 1.

business_street_address

Street address of the company's place of business.

business_street_address_2

Second line of the street address of the company's place of business.

business_city

City where the company's place of business is located.

business_state

State where the company's place of business is located.

business_zip_code

Zip code of the company's place of business.

business_zip_code_4

Zip code suffix of the company's place of business.

business_postal_code

Non-US postal code of the company's place of business.

mail_street_address

Street portion of the company's mailing address.

mail_street_address_2

Second line of the street portion of the company's mailing address.

mail_city

City of the company's mailing address.

mail_state

State of the company's mailing address.

mail_zip_code

Zip code of the company's mailing address.

mail_zip_code_4

Zip code suffix of the company's mailing address.

mail_postal_code

Non-US postal code of the company's mailing address.

source_url

URL pointing to the original source of the data in the record.

out_sec10k__quarterly_filings

package: pudl

Quarterly time series of metadata describing SEC 10-K filings.

Processing:

Data has been expanded into a wide/denormalized format, with IDs and codes accompanied by human-readable names and descriptions.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

filename_sec10k

Usage Warnings

  • PUDL does not currently update its copy of this data.

Additional Details

Each SEC 10-K filing is submitted by a single company, but may contain information about numerous other companies. This table indicates the company submitting the filing, as well as some information about the overall filing. Each filing is guaranteed to have a unique filename, but ~1% of all filings are one company submitting the same form multiple times on the same day, so the filename is the only available natural primary key. This output table adds a link to the source URL for the filing, which is constructed from the filename.

Columns
filename_sec10k

Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.

central_index_key

Identifier of the company in SEC database.

company_name

Name of company submitting SEC 10k filing.

sec10k_type

Specific version of SEC 10-K that was filed. 10-k: the standard annual report. 10-k/a: an amended version of the annual report. 10-k405: filed to report insider trading that was not reported in a timely fashion. 10-k405/a: an amended version of the 10-k405. 10-kt: submitted in lieu of or in addition to a standard 10-K annual report when a company changes the end of its fiscal year (e.g. due to a merger) leaving the company with a longer or shorter reporting period. 10-kt/a: an amended version of the 10-kt. 10-ksb: the annual report for small businesses, also known as penny stocks. 10-ksb/a: an amended version of the 10-ksb.

filing_date

Date on which the filing was submitted.

exhibit_21_version

Version of exhibit 21 submitted (if applicable).

report_date

Date reported.

source_url

URL pointing to the original source of the data in the record.

out_vcerare__hourly_available_capacity_factor

package: pudl

Hourly time series of estimated county-averaged capacity factors for wind and solar generating facilities across the contiguous United States (US), to be used as a tool and input for resource adequacy modeling and planning.

Processing:

Data is ready for use in analyses, but for practical reasons has not been denormalized and remains in narrow format.

Source:

Vibrant Clean Energy Resource Adequacy Renewable Energy (RARE) Power Dataset

Primary key:

state, place_name, datetime_utc

Usage Warnings

  • The hourly capacity factors are normalized to unity for maximal power output. To convert to units of power, the user must multiply by the installed capacity within the county.

  • Hourly capacity factors are spatially averaged across each county over the contiguous USA. There are a handful of counties that are too small to pick up representation on the HRRR operational forecast grid. As such, these counties will have no wind or solar power production curves.

  • Due to power production performance being correlated with panel temperatures, during cold sunny periods, some solar capacity factor values are greater than 1 (but less that 1.1).

Additional Details

The data in this table were produced by Vibrant Clean Energy, and are licensed to the public under the Creative Commons Attribution 4.0 International license (CC-BY-4.0).

The technologies provided are:

  1. Onshore wind assuming a 100m hub height and 120m rotor diameter;

  2. Offshore wind assuming a 140m hub height and 120m rotor diameter;

  3. Utility solar assuming a fixed axis panel tilted at latitude.

The foundation of the capacity factors provided here is the NOAA HRRR operational numerical weather prediction model. The HRRR covers the entire contiguous US at a horizontal resolution of 3 km. Forecasts are initialized each hour of the year. Forecast hour two (2) is used as the input data for the power algorithms. This forecast hour is chosen to trade-off the impact of the measurement and data assimilation procedure of the HRRR with the physics of the model to derive the most complete picture of the atmosphere at the forecast time horizon.

For wind capacity factors: vertical slices of the atmosphere are considered across the defined rotor swept area. Bringing together wind speed, density, temperature and icing information, a power capacity is estimated using a representative power coefficient (Cp) curve to determine the power from a given wind speed, atmospheric density and temperature. There is no wake modeling included in the dataset.

For solar capacity factors: pertinent surface weather variables are pulled such as incoming short wave radiation, direct normal irradiance (calculated in the HRRR 2016 forward), surface temperature and other parameters. These are used in a non-linear I-V curve translation to power capacity factors.

Columns
state

Two letter US state abbreviation.

place_name

County or lake name, sourced from the latest Census PEP vintage based on county FIPS ID. Lake names originate from VCE RARE directly, and may also appear several times--once for each state it touches. FIPS ID values for lakes have been nulled.

datetime_utc

Date and time converted to Coordinated Universal Time (UTC).

report_year

Four-digit year in which the data was reported.

hour_of_year

Integer between 1 and 8670 representing the hour in a given year.

county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4.

latitude

Latitude of the place centroid (e.g., county centroid).

longitude

Longitude of the place centroid (e.g., county centroid).

capacity_factor_solar_pv

Estimated capacity factor (0-1) calculated for solar PV assuming a fixed axis panel tilted at latitude and DC power outputs. Due to power production performance being correlated with panel temperatures, during cold sunny periods, some solar capacity factor values are greater than 1 (but less that 1.1).All values are based on outputs from the NOAA HRRR operational numerical weather prediction model. Capacity factors are normalized to unity for maximal power output. Pertinent surface weather variables are pulled such as incoming short wave radiation, direct normal irradiance (calculated in the HRRR 2016 forward), surface temperature and other parameters. These are used in a non-linear I-V curve translation to power capacity factors.

capacity_factor_onshore_wind

Estimated capacity factor (0-1) calculated for onshore wind assuming a 100m hub height and 120m rotor diameter.Based on outputs from the NOAA HRRR operational numerical weather prediction model. Capacity factors are normalized to unity for maximal power output. Vertical slices of the atmosphere are considered across the defined rotor swept area. Bringing together wind speed, density, temperature and icing information, a power capacity is estimated using a representative power coefficient (Cp) curve to determine the power from a given wind speed, atmospheric density and temperature. There is no wake modeling included in the dataset.

capacity_factor_offshore_wind

Estimated capacity factor (0-1) calculated for offshore wind assuming a 140m hub height and 120m rotor diameter.Based on outputs from the NOAA HRRR operational numerical weather prediction model. Capacity factors are normalized to unity for maximal power output. Vertical slices of the atmosphere are considered across the defined rotor swept area. Bringing together wind speed, density, temperature and icing information, a power capacity is estimated using a representative power coefficient (Cp) curve to determine the power from a given wind speed, atmospheric density and temperature. There is no wake modeling included in the dataset.

core_eia176__yearly_gas_disposition

package: pudl

Annual time series of a company's natural and supplemental gas disposition for the report state.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 176 -- Annual Report of Natural and Supplemental Gas Supply and Disposition (Part 6, Lines 9, 12-20)

Primary key:

operator_id_eia, report_year

Additional Details

The deliveries_out_of_state_volume_mcf (Line 14.0) are reported as one aggregated volume, calculated by summing the original granular data. Similarly, disposition_to_other_volume_mcf (Line 18.4) is summed from the original granular data and reported as one aggregate field.

The delivered_gas_heat_content_mmbtu_per_mcf is expected to be between 0.8 and 1.2 by the EIA. We find that less than 0.5 percent of data falls outside of these expected bounds.

The total_disposition_mcf field includes disposition to consumers which is reported in core_eia176__yearly_gas_disposition_by_consumer. Note that the reported total disposition and the sum of values in this table and core_eia176__yearly_gas_disposition_by_consumer often don't match as would be expected. Thus, we have preserved both the total field and the sub-components in these two tables.

Columns
operator_id_eia

The unique EIA identifier for an operator in a given state. The last two letters of the ID indicate the state.

report_year

Four-digit year in which the data was reported.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

delivered_gas_heat_content_mmbtu_per_mcf

The average annual heat content of gas delivered directly to consumers.

operational_consumption_facility_space_heat_mcf

Volume of natural gas consumed as fuel for space heat of the operator's own facilities within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

operational_consumption_new_pipeline_fill_mcf

Volume of natural gas consumed as fuel for the operator's new pipeline fill within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

operational_consumption_compressors_mcf

Volume of natural gas consumed as fuel for the operator's own pipeline distribution or storage compressor use within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

operational_consumption_lng_vaporization_liquefaction_mcf

Volume of natural gas consumed by the operator for vaporization, liquefaction, and LNG fuel within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

operational_consumption_vehicle_fuel_mcf

Volume of natural gas used in the operator's company-owned fleet within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

operational_consumption_other_mcf

Volume of natural gas consumed as fuel by the operator for other purposes within the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

operational_consumption_other_detail

Free-text detail describing the operator’s specified 'other purposes' for operational natural-gas consumption, corresponding to the volume reported in operational_consumption_other_mcf.

operational_storage_underground_mcf

The total volume added to underground storage operated by the company within the report state, regardless of ownership of the gas. This includes new fields. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

operational_lng_storage_injections_mcf

The total volume added to LNG storage operated by the company within the report state, regardless of ownership of the gas. For LNG import and export marine terminals, this should exclude injections of LNG during the course of routine operations for handling imports, as well as LNG held in storage for future use. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

producer_lease_use_mcf

The total volume of gas used in the company's well, field and lease operations. This field should only be reported by producers. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

producer_returned_for_repressuring_reinjection_mcf

The volume of gas delivered directly from the operator's system to oil or gas fields located within the report state for repressuring, pressure maintenance, and cycling operations. This field should only be reported by producers. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

disposition_distribution_companies_mcf

Disposition to distribution companies within the report state that does not fall into one of the other reported categories in lines 10.1-17.0. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

disposition_storage_operators_mcf

Disposition to storage operators within the report state that does not fall into one of the other reported categories in lines 10.1-17.0. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

disposition_other_pipelines_mcf

Disposition to other pipelines within the report state that does not fall into one of the other reported categories in lines 10.1-17.0. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

disposition_out_of_state_mcf

Total volume of the operator's deliveries across or to state lines or U.S. borders. This has been summed from the detailed data reported by each company on Line 14.0 of the original form in order to preserve the primary key of the table. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

other_disposition_all_other_mcf

Other disposition within the report state that does not fall into one of the other reported categories in lines 10.1-17.0. This has been summed from the detailed data reported by each company on Line 18.4 of the original form in order to preserve the primary key of the table. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

total_disposition_mcf

Total disposition within the report state, as reported by the operator. This includes disposition to consumers which is reported in core_eia176__yearly_gas_disposition_by_consumer. Note that the reported total disposition and the sum of values in this table and core_eia176__yearly_gas_disposition_by_consumer often don't match as it would be expected. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

losses_mcf

Known loss volumes as a result of leaks, damage, accidents, migration and blow down within the report state where the events took place, as well as estimated losses from leaks occurring during distribution activities. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

unaccounted_for_mcf

The difference between gas supply and disposition. A positive entry indicates supply in excess of accounted-for disposition. A negative entry indicates accounted-for disposition exceeds reported supply. This is calculated as the difference between Part 4 Line 7.0 and Part 6 Line 19.0, and is reported as Line 20.0 in the original form.

core_eia176__yearly_gas_disposition_by_consumer

package: pudl

Annual time series of a company's natural gas deliveries to end-use consumers within the report state.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 176 -- Annual Report of Natural and Supplemental Gas Supply and Disposition (Part 6, Lines 10.0-11.6)

Primary key:

report_year, operator_id_eia, customer_class, revenue_class

Columns
report_year

Four-digit year in which the data was reported.

operator_id_eia

The unique EIA identifier for an operator in a given state. The last two letters of the ID indicate the state.

operating_state

State that the operator is reporting for.

customer_class

High level categorization of customer type (e.g., commercial, residential).

revenue_class

Source of revenue: whether revenue originates from gas owned directly by the operator (sales) or gas transported by the operator (transport).

consumers

Number of end-use consumers within the report state.

revenue

Revenue including taxes, rounded to the nearest whole dollar.

volume_mcf

Total volume of natural gas deliveries in the report state. Reference conditions for measurement are 14.73 psia and 60° Fahrenheit.

core_eia860__assn_boiler_cooling

package: pudl

Association table providing connections between EIA boiler IDs and EIA cooling system IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

report_date, plant_id_eia, boiler_id, cooling_id_eia

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

boiler_id

Alphanumeric boiler ID.

cooling_id_eia

EIA Identification code for cooling system (if multiple cooling systems are not distinguished by separate IDs, the word 'PLANT' is listed to encompass the cooling system for the entire plant)

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia860__assn_boiler_generator

package: pudl

Association table providing connections between boilers and generators.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 6, Part A)

Primary key:

plant_id_eia, report_date, generator_id, boiler_id

Usage Warnings

  • Contains information from multiple raw inputs.

Additional Details

Augmented with various heuristics within PUDL.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

Date reported.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

boiler_id

Alphanumeric boiler ID.

unit_id_eia

EIA-assigned unit identification code.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

boiler_generator_assn_type_code

Indicates whether boiler associations with generator during the year were actual or theoretical. Only available before 2013.

steam_plant_type_code

Code that describes types of steam plants from EIA 860. See steam_plant_types_eia table for more details.

bga_source

The source from where the unit_id_pudl is compiled. The unit_id_pudl comes directly from EIA 860, or string association (which looks at all the boilers and generators that are not associated with a unit and tries to find a matching string in the respective collection of boilers or generator), or from a unit connection (where the unit_id_eia is employed to find additional boiler generator connections).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia860__assn_boiler_stack_flue

package: pudl

Association table providing connections between EIA boiler IDs and EIA stack and/or flue system IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

report_date, plant_id_eia, boiler_id, stack_flue_id_pudl

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

boiler_id

Alphanumeric boiler ID.

stack_id_eia

The stack identification value reported to EIA. Stacks or chimneys are the place where emissions from the combustion process are released into the atmosphere. This field was reported in conjunction with flue_id_eia until 2013 when stack_flue_id_eia took their place.

flue_id_eia

The flue identification value reported to EIA. The flue is a duct, pipe, or opening that transports exhast gases through the stack. This field was reported in conjunction with stack_id_eia until 2013 when stack_flue_id_eia took their place.

stack_flue_id_eia

The stack or flue identification value reported to EIA. This denotes the place where emissions from the combustion process are released into the atmosphere. Prior to 2013, this was reported as stack_id_eia and flue_id_eia.

stack_flue_id_pudl

A stack and/or flue identification value created by PUDL for use as part of the primary key for the stack flue equipment and boiler association tables. For 2013 and onward, this value is equal to the value for stack_flue_id_eia. Prior to 2013, this value is equal to the value for stack_id_eia and the value for flue_id_eia separated by an underscore or just the stack_flue_eia in cases where flue_id_eia is NA.

core_eia860__assn_yearly_boiler_emissions_control_equipment

package: pudl

Association table providing connections between EIA boiler IDs and emissions control IDs for NOx, SO2, mercury, and particulate monitoring.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

report_date, plant_id_eia, boiler_id, emission_control_id_type, emission_control_id_eia

Usage Warnings

  • The relationship between the IDs is sometimes many to many.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

boiler_id

Alphanumeric boiler ID.

emission_control_id_type

The type of emissions control id: SO2, NOx, particulate, or mercury.

emission_control_id_eia

The emission control ID used to collect SO2, NOx, particulate, and mercury emissions data. This column should be used in conjunction with emissions_control_type as it's not guaranteed to be unique.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia860__scd_boilers

package: pudl

Slowly changing dimension (SCD) table describing attributes of boilers which may vary from year to year. Compiled from across all EIA-860 data.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

plant_id_eia, boiler_id, report_date

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This is one of two tables where canonical values for boilers are set. It contains values which are expected to vary slowly, while core_eia__entity_boilers contains those expected to remain fixed. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

boiler_id

Alphanumeric boiler ID.

report_date

Date reported.

boiler_operating_date

Date the boiler began or is planned to begin commercial operation.

boiler_status

EIA short code identifying boiler operational status.

boiler_retirement_date

Date of the scheduled or effected retirement of the boiler.

boiler_type

EIA short code indicating the standards under which the boiler is operating as described in the U.S. EPA regulation under 40 CFR.

firing_type_1

EIA short code indicating the type of firing used by this boiler.

firing_type_2

EIA short code indicating the type of firing used by this boiler.

firing_type_3

EIA short code indicating the type of firing used by this boiler.

firing_rate_using_coal_tons_per_hour

Design firing rate at maximum continuous steam flow for coal to the nearest 0.1 ton per hour.

firing_rate_using_oil_bbls_per_hour

Design firing rate at maximum continuous steam flow for pet coke to the nearest 0.1 barrels per hour.

firing_rate_using_gas_mcf_per_hour

Design firing rate at maximum continuous steam flow for gas to the nearest 0.1 cubic feet per hour.

firing_rate_using_other_fuels

Design firing rate at maximum continuous steam flow for energy sources other than coal, petroleum, or natural gas.

boiler_fuel_code_1

The code representing the most predominant type of energy that fuels the boiler.

boiler_fuel_code_2

The code representing the second most predominant type of energy that fuels the boiler.

boiler_fuel_code_3

The code representing the third most predominant type of energy that fuels the boiler.

boiler_fuel_code_4

The code representing the fourth most predominant type of energy that fuels the boiler.

waste_heat_input_mmbtu_per_hour

Design waste-heat input rate at maximum continuous steam flow where a waste-heat boiler is a boiler that receives all or a substantial portion of its energy input from the noncumbustible exhaust gases of a separate fuel-burning process (MMBtu per hour).

wet_dry_bottom

Wet or Dry Bottom where Wet Bottom is defined as slag tanks that are installed at furnace throat to contain and remove molten ash from the furnace, and Dry Bottom is defined as having no slag tanks at furnace throat area, throat area is clear, and bottom ash drops through throat to bottom ash water hoppers.

fly_ash_reinjection

Indicates whether the boiler is capable of re-injecting fly ash.

hrsg

indicates if the boiler is a heat recovery steam generator (HRSG).

max_steam_flow_lbs_per_hour

Maximum continuous steam flow at 100 percent load.

turndown_ratio

The turndown ratio for the boiler.

efficiency_100pct_load

Boiler efficiency percentage when burning at 100 percent load to the nearest 0.1 percent.

efficiency_50pct_load

Boiler efficiency percentage when burning at 50 percent load to the nearest 0.1 percent.

air_flow_100pct_load_cubic_feet_per_minute

Total air flow including excess air at 100 percent load, reported at standard temperature and pressure (i.e. 68 F and one atmosphere pressure).

new_source_review

Indicates whether the boiler is subject to New Source Review requirements.

new_source_review_date

Month of issued New Source Review permit.

new_source_review_permit

New Source Review permit number.

regulation_particulate

EIA short code for most stringent type of statute or regulation code under which the boiler is operating for particulate matter control standards.

regulation_so2

EIA short code for most stringent type of statute or regulation code under which the boiler is operating for sulfur dioxide control standards.

regulation_nox

EIA short code for most stringent type of statute or regulation code under which the boiler is operating for nitrogen oxide control standards.

standard_particulate_rate

Numeric value for the unit of measurement specified for particulate matter.

standard_so2_rate

Numeric value for the unit of measurement specified for sulfur dioxide.

standard_nox_rate

Numeric value for the unit of measurement specified for nitrogen oxide.

unit_particulate

Numeric value for the unit of measurement specified for particulate matter.

unit_so2

Numeric value for the unit of measurement specified for sulfur dioxide.

unit_nox

Numeric value for the unit of measurement specified for nitrogen oxide.

compliance_year_particulate

Year boiler was or is expected to be in compliance with federal, state and/or local regulations for particulate matter emissions.

compliance_year_nox

Year boiler was or is expected to be in compliance with federal, state and/or local regulations for nitrogen oxide emissions.

compliance_year_so2

Year boiler was or is expected to be in compliance with federal, state and/or local regulations for sulfur dioxide emissions.

particulate_control_out_of_compliance_strategy_1

If boiler is not in compliance with particulate matter regulations, strategy for compliance.

particulate_control_out_of_compliance_strategy_2

If boiler is not in compliance with particulate matter regulations, strategy for compliance.

particulate_control_out_of_compliance_strategy_3

If boiler is not in compliance with particulate matter regulations, strategy for compliance.

so2_control_out_of_compliance_strategy_1

If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.

so2_control_out_of_compliance_strategy_2

If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.

so2_control_out_of_compliance_strategy_3

If boiler is not in compliance with sulfur dioxide regulations, strategy for compliance.

so2_control_existing_caaa_compliance_strategy_1

Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_existing_caaa_compliance_strategy_2

Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_existing_caaa_compliance_strategy_3

Existing strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_planned_caaa_compliance_strategy_1

Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_planned_caaa_compliance_strategy_2

Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

so2_control_planned_caaa_compliance_strategy_3

Planned strategies to meet the sulfur dioxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_out_of_compliance_strategy_1

If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.

nox_control_out_of_compliance_strategy_2

If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.

nox_control_out_of_compliance_strategy_3

If boiler is not in compliance with nitrogen oxide regulations, strategy for compliance.

nox_control_existing_caaa_compliance_strategy_1

Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_existing_caaa_compliance_strategy_2

Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_existing_caaa_compliance_strategy_3

Existing strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_planned_caaa_compliance_strategy_1

Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_planned_caaa_compliance_strategy_2

Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

nox_control_planned_caaa_compliance_strategy_3

Planned strategies to meet the nitrogen oxide requirements of Title IV of the Clean Air Act Amendment of 1990.

compliance_year_mercury

Year boiler was or is expected to be in compliance with federal, state and/or local regulations for mercury emissions.

mercury_control_existing_strategy_1

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_2

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_3

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_4

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_5

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_existing_strategy_6

Existing strategy to comply with the most stringent mercury regulation.

mercury_control_proposed_strategy_1

Proposed strategy to comply with the most stringent mercury regulation.

mercury_control_proposed_strategy_2

Proposed strategy to comply with the most stringent mercury regulation.

mercury_control_proposed_strategy_3

Proposed strategy to comply with the most stringent mercury regulation.

nox_control_existing_strategy_1

Existing strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_existing_strategy_2

Existing strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_existing_strategy_3

Existing strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_manufacturer

Name of nitrogen oxide control manufacturer.

nox_control_manufacturer_code

Code indicating the nitrogen oxide control burner manufacturer.

nox_control_proposed_strategy_1

Proposed strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_proposed_strategy_2

Proposed strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_proposed_strategy_3

Proposed strategy to comply with the most stringent nitrogen oxide regulation.

nox_control_status_code

Nitrogen oxide control status code.

regulation_mercury

Most stringent type of statute or regulation code under which the boiler is operating for mercury control standards.

so2_control_existing_strategy_1

Existing strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_existing_strategy_2

Existing strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_existing_strategy_3

Existing strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_proposed_strategy_1

Proposed strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_proposed_strategy_2

Proposed strategy to comply with the most stringent sulfur dioxide regulation.

so2_control_proposed_strategy_3

Proposed strategy to comply with the most stringent sulfur dioxide regulation.

standard_so2_percent_scrubbed

The percent of sulfur dioxide to be scrubbed specified by the most stringent sulfur dioxide regulation.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia860__scd_emissions_control_equipment

package: pudl

Slowly changing dimension (SCD) table describing attributes of emissions control equipment reported to EIA, including cost, type, operating status, retirement date, and install year.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

report_year, plant_id_eia, emission_control_id_pudl

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

Additional Details

Includes control ids for sulfur dioxide (SO2), particulate matter, mercury, nitrogen oxide (NOX), and acid (HCl) gas monitoring.

This table inherits canonicalized values for plants and utilities. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value.

Columns
report_year

Four-digit year in which the data was reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

emission_control_id_pudl

A PUDL-generated ID used to distinguish emission control units in the same report year and plant id. This ID should not be used to track units over time or between plants.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

emission_control_equipment_type_code

Short code indicating the type of emission control equipment installed.

operational_status_code

The operating status of the asset.

mercury_control_id_eia

Mercury control identification number. This ID is not a unique identifier.

nox_control_id_eia

Nitrogen oxide control identification number. This ID is not a unique identifier.

particulate_control_id_eia

Particulate matter control identification number. This ID is not a unique identifier.

so2_control_id_eia

Sulfur dioxide control identification number. This ID is not a unique identifier.

acid_gas_control

Indicates whether the emissions control equipment controls acid (HCl) gas.

emission_control_equipment_cost

The total cost to install a piece of emission control equipment.

emission_control_operating_date

The date a piece of emissions control equipment began operating. Derived from month and year columns in the raw data.

emission_control_retirement_date

The expected or actual retirement date for a piece of emissions control equipment. Derived from month and year columns in the raw data.

core_eia860__scd_generators

package: pudl

Slowly changing dimension (SCD) table describing attributes of generators which may vary from year to year. Compiled from across EIA-860 and EIA-923 data.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

plant_id_eia, generator_id, report_date

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This is one of two tables where canonical values for generators are set. It contains values which are expected to vary slowly, while core_eia__entity_generators contains those expected to remain fixed. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

utility_id_eia

The EIA Utility Identification number.

report_date

Date reported.

operational_status_code

The operating status of the asset.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

ownership_code

Identifies the ownership for each generator.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

summer_capacity_mw

The net summer capacity.

summer_capacity_estimate

Whether the summer capacity value was an estimate

winter_capacity_mw

The net winter capacity.

winter_capacity_estimate

Whether the winter capacity value was an estimate

net_capacity_mwdc

Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.

energy_storage_capacity_mwh

Energy storage capacity in MWh (e.g. for batteries).

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

energy_source_code_2

The code representing the second most predominant type of energy that fuels the generator

energy_source_code_3

The code representing the third most predominant type of energy that fuels the generator

energy_source_code_4

The code representing the fourth most predominant type of energy that fuels the generator

energy_source_code_5

The code representing the fifth most predominant type of energy that fuels the generator

energy_source_code_6

The code representing the sixth most predominant type of energy that fuels the generator

energy_source_1_transport_1

Primary mode of transport for energy source 1.

energy_source_1_transport_2

Secondary mode of transport for energy source 1.

energy_source_1_transport_3

Tertiary mode of transport for energy source 1.

energy_source_2_transport_1

Primary mode of transport for energy source 2.

energy_source_2_transport_2

Secondary mode of transport for energy source 2.

energy_source_2_transport_3

Tertiary mode of transport for energy source 2.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

can_burn_multiple_fuels

Whether the generator can burn multiple fuels.

deliver_power_transgrid

Indicate whether the generator can deliver power to the transmission grid.

distributed_generation

Whether the generator is considered distributed generation

synchronized_transmission_grid

Indicates whether standby generators (SB status) can be synchronized to the grid.

turbines_num

Number of wind turbines, or hydrokinetic buoys.

planned_modifications

Indicates whether there are any planned capacity uprates/derates, repowering, other modifications, or generator retirements scheduled for the next 5 years.

planned_net_summer_capacity_uprate_mw

Increase in summer capacity expected to be realized from the modification to the equipment.

planned_net_winter_capacity_uprate_mw

Increase in winter capacity expected to be realized from the uprate modification to the equipment.

planned_uprate_date

Planned effective date that the generator is scheduled to enter operation after the uprate modification.

planned_net_summer_capacity_derate_mw

Decrease in summer capacity expected to be realized from the derate modification to the equipment.

planned_net_winter_capacity_derate_mw

Decrease in winter capacity expected to be realized from the derate modification to the equipment.

planned_derate_date

Planned effective month that the generator is scheduled to enter operation after the derate modification.

planned_new_prime_mover_code

New prime mover for the planned repowered generator.

planned_energy_source_code_1

New energy source code for the planned repowered generator.

planned_repower_date

Planned effective date that the generator is scheduled to enter operation after the repowering is complete.

other_planned_modifications

Indicates whether there are there other modifications planned for the generator.

other_modifications_date

Planned effective date that the generator is scheduled to enter commercial operation after any other planned modification is complete.

planned_generator_retirement_date

Planned effective date of the scheduled retirement of the generator.

carbon_capture

Indicates whether the generator uses carbon capture technology.

startup_source_code_1

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_2

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_3

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

startup_source_code_4

The code representing the first, second, third or fourth start-up and flame stabilization energy source used by the combustion unit(s) associated with this generator.

technology_description

High level description of the technology used by the generator to produce electricity.

turbines_inverters_hydrokinetics

Number of wind turbines, or hydrokinetic buoys.

time_cold_shutdown_full_load_code

The minimum amount of time required to bring the unit to full load from shutdown.

planned_new_capacity_mw

The expected new namplate capacity for the generator.

can_cofire_fuels

Whether the generator can co-fire fuels.

can_switch_oil_gas

Whether the generator can switch between oil and natural gas.

nameplate_power_factor

The nameplate power factor of the generator.

minimum_load_mw

The minimum load at which the generator can operate at continuosuly.

uprate_derate_during_year

Was an uprate or derate completed on this generator during the reporting year?

uprate_derate_completed_date

The date when the uprate or derate was completed.

current_planned_generator_operating_date

The most recently updated effective date on which the generator is scheduled to start operation

summer_estimated_capability_mw

EIA estimated summer capacity (in MWh).

winter_estimated_capability_mw

EIA estimated winter capacity (in MWh).

generator_retirement_date

Date of the scheduled or effected retirement of the generator.

owned_by_non_utility

Whether any part of generator is owned by a nonutilty

reactive_power_output_mvar

Reactive Power Output (MVAr)

ferc_qualifying_facility

Indicates whether or not a generator is a qualifying FERC cogeneration facility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia860__scd_generators_energy_storage

package: pudl

Slowly changing dimension (SCD) table describing attributes of energy storage which may vary from year to year.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 3)

Primary key:

plant_id_eia, generator_id, report_date

Additional Details

This table includes only those values that are unique to energy storage. The rest of the columns that are reported in the EIA-860 Energy Storage tabs are included in core_eia860__scd_generators and core_eia__entity_generators.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

max_charge_rate_mw

Maximum charge rate in MW.

max_discharge_rate_mw

Maximum discharge rate in MW.

storage_enclosure_code

A code representing the enclosure type that best describes where the generator is located.

storage_technology_code_1

The electro-chemical storage technology used for this battery applications.

storage_technology_code_2

The electro-chemical storage technology used for this battery applications.

storage_technology_code_3

The electro-chemical storage technology used for this battery applications.

storage_technology_code_4

The electro-chemical storage technology used for this battery applications.

served_arbitrage

Whether the energy storage device served arbitrage applications during the reporting year

served_backup_power

Whether the energy storage device served backup power applications during the reporting year.

served_co_located_renewable_firming

Whether the energy storage device served renewable firming applications during the reporting year.

served_frequency_regulation

Whether the energy storage device served frequency regulation applications during the reporting year.

served_load_following

Whether the energy storage device served load following applications during the reporting year.

served_load_management

Whether the energy storage device served load management applications during the reporting year.

served_ramping_spinning_reserve

Whether the this energy storage device served ramping / spinning reserve applications during the reporting year.

served_system_peak_shaving

Whether the energy storage device served system peak shaving applications during the reporting year.

served_transmission_and_distribution_deferral

Whether the energy storage device served renewable firming applications during the reporting year.

served_voltage_or_reactive_power_support

Whether the energy storage device served voltage or reactive power support applications during the reporting year.

stored_excess_wind_and_solar_generation

Whether the energy storage device was used to store excess wind/solar generation during the reporting year.

is_ac_coupled

Indicates if this energy storage device is AC-coupled (means the energy storage device and the PV system are not installed on the same side of an inverter).

is_dc_coupled

Indicates if this energy storage device is DC-coupled (means the energy storage device and the PV system are on the same side of an inverter and the battery can still charge from the grid).

is_dc_coupled_tightly

Indicates if this energy storage device is DC tightly coupled (means the energy storage device and the PV system are on the same side of an inverter and the battery cannot charge from the grid).

is_independent

Indicates if this energy storage device is independent (not coupled with another generators)

is_transmission_and_distribution_asset_support

Indicate if the energy storage system is intended to support a specific substation, transmission or distribution asset.

is_direct_support

Indicates if this energy storage device is intended for dedicated generator firming or storing excess generation of other units.

plant_id_eia_direct_support_1

The EIA Plant ID of the primary unit whose generation this energy storage device is intended to firm or store.

generator_id_direct_support_1

The EIA Generator ID of the primary unit whose generation this energy storage device is intended to firm or store.

plant_id_eia_direct_support_2

The EIA Plant ID of the secondary unit whose generation this energy storage device is intended to firm or store.

generator_id_direct_support_2

The EIA Generator ID of the secondary unit whose generation this energy storage device is intended to firm or store.

plant_id_eia_direct_support_3

The EIA Plant ID of the tertiary unit whose generation this energy storage device is intended to firm or store.

generator_id_direct_support_3

The EIA Generator ID of the tertiary unit whose generation this energy storage device is intended to firm or store.

core_eia860__scd_generators_multifuel

package: pudl

Slowly changing dimension (SCD) table describing attributes of generators that may vary from year to year, pertaining to fuel-switching and the use of multiple fuels.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 3)

Primary key:

report_date, utility_id_eia, generator_id, plant_id_eia

Columns
report_date

Date reported.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_name_eia

Plant name.

state

Two letter US state abbreviation.

county

County name.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

operational_status_code

The operating status of the asset.

technology_description

High level description of the technology used by the generator to produce electricity.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

sector_name_eia

EIA assigned sector name, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

sector_id_eia

EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

capacity_mw

Total installed (nameplate) capacity, in megawatts.

summer_capacity_mw

The net summer capacity.

winter_capacity_mw

The net winter capacity.

current_planned_generator_operating_date

The most recently updated effective date on which the generator is scheduled to start operation

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

energy_source_code_2

The code representing the second most predominant type of energy that fuels the generator

can_burn_multiple_fuels

Whether the generator can burn multiple fuels.

can_cofire_fuels

Whether the generator can co-fire fuels.

cofire_energy_source_1

The codes representing the type of fuel that will be co-fired.

cofire_energy_source_2

The codes representing the type of fuel that will be co-fired.

cofire_energy_source_3

The codes representing the type of fuel that will be co-fired.

cofire_energy_source_4

The codes representing the type of fuel that will be co-fired.

cofire_energy_source_5

The codes representing the type of fuel that will be co-fired.

cofire_energy_source_6

The codes representing the type of fuel that will be co-fired.

can_switch_oil_gas

Whether the generator can switch between oil and natural gas.

time_to_switch_oil_to_gas

The time required to switch the generator from running 100 percent oil to running 100 percent natural gas.

time_to_switch_gas_to_oil

The time required to switch the generator from running 100 percent natural gas to running 100 percent oil.

can_switch_when_operating

Indicates whether a fuel switching generator can switch fuels while operating.

net_summer_capacity_natural_gas_mw

The maximum net summer output achievable when running on natural gas.

net_summer_capacity_oil_mw

The maximum net summer output achievable when running on oil.

net_winter_capacity_natural_gas_mw

The maximum net winter output achievable when running on natural gas.

net_winter_capacity_oil_mw

The maximum net summer output achievable when running on oil.

has_factors_that_limit_switching

Whether there are factors that limit the generator's ability to switch between oil and natural gas.

has_storage_limits

Whether limited on-site fuel storage is a factor that limits the generator's ability to switch between oil and natural gas.

has_air_permit_limits

Whether air permit limits are a factor that limits the generator's ability to switch between oil and natural gas.

has_other_factors_that_limit_switching

Whether there are factors other than air permit limits and storage that limit the generator's ability to switch between oil and natural gas.

can_cofire_oil_and_gas

Whether the generator can co-fire oil and gas.

can_cofire_100_oil

Whether the generator can co-fire 100 oil.

max_oil_heat_input

The maximum oil heat input (percent of MMBtus) expected for proposed unit when co-firing with natural gas

max_oil_output_mw

The maximum output (net MW) expected for proposed unit, when making the maximum use of oil and co-firing natural gas.

can_fuel_switch

Whether a unit is able to switch fuels.

has_regulatory_limits

Whether there are factors that limit the operation of the generator when running on 100 percent oil

fuel_switch_energy_source_1

The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.

fuel_switch_energy_source_2

The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.

fuel_switch_energy_source_3

The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.

fuel_switch_energy_source_4

The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.

fuel_switch_energy_source_5

The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.

fuel_switch_energy_source_6

The codes representing the type of fuel that will be able to be used as a sole source of fuel for this unit.

core_eia860__scd_generators_solar

package: pudl

Slowly changing dimension (SCD) table describing attributes of solar generators that may vary from year to year.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 3.3)

Primary key:

plant_id_eia, generator_id, report_date

Additional Details

This table includes only those values that are unique to solar generators. The rest of the columns that are reported in the EIA-860 Solar tabs are included in core_eia860__scd_generators and core_eia__entity_generators.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

standard_testing_conditions_capacity_mwdc

The net capacity of this photovoltaic generator in direct current under standard test conditions (STC) of 1000 W/m^2 solar irradiance and 25 degrees Celsius PV module temperature. This was only reported in 2013 and 2014.

net_metering_capacity_mwdc

The DC megawatt capacity that is part of a net metering agreement.

uses_net_metering_agreement

Indicates if the output from this generator is part of a net metering agreement.

uses_virtual_net_metering_agreement

Indicates if the output from this generator is part of a virtual net metering agreement.

virtual_net_metering_capacity_mwdc

The DC capacity in MW that is part of a virtual net metering agreement.

azimuth_angle_deg

Indicates the azimuth angle of the unit for fixed tilt or single-axis technologies.

tilt_angle_deg

Indicates the tilt angle of the unit for fixed tilt or single-axis technologies.

uses_technology_lenses_mirrors

Indicates whether lenses or mirrors are used at this solar generating unit.

uses_technology_single_axis_tracking

Indicates whether single-axis tracking technologies are used at this solar generating unit.

uses_technology_dual_axis_tracking

Indicates whether dual-axis tracking technologies are used at this solar generating unit.

uses_technology_fixed_tilt

Indicates whether fixed tilt technologies are used at this solar generating unit.

uses_technology_east_west_fixed_tilt

Indicates whether east west fixed tilt technologies are used at this solar generating unit.

uses_technology_parabolic_trough

Indicates whether parabolic trough technologies s are used at this solar generating unit.

uses_technology_linear_fresnel

Indicates whether linear fresnel technologies are used at this solar generating unit.

uses_technology_power_tower

Indicates whether power towers are used at this solar generating unit.

uses_technology_dish_engine

Indicates whether dish engines are used at this solar generating unit.

uses_technology_other

Indicates whether other solar technologies are used at this solar generating unit.

uses_material_crystalline_silicon

Indicates whether any solar photovoltaic panels at this generator are made of crystalline silicon.

uses_material_thin_film_a_si

Indicates whether any solar photovoltaic panels at this generator are made of thin-film amorphous silicon (A-Si).

uses_material_thin_film_cdte

Indicates whether any solar photovoltaic panels at this generator are made of thin-film cadmium telluride (CdTe).

uses_material_thin_film_cigs

Indicates whether any solar photovoltaic panels at this generator are made of thin-film copper indium gallium diselenide (CIGS).

uses_material_thin_film_other

Indicates whether any solar photovoltaic panels at this generator are made of other thin-film material.

uses_material_other

Indicates whether any solar photovoltaic panels at this generator are made of other materials.

uses_bifacial_panels

Indicates whether bifacial solar panels are used at this solar generating unit.

core_eia860__scd_generators_wind

package: pudl

Slowly changing dimension (SCD) table describing attributes of wind generators that may vary from year to year.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 3.2)

Primary key:

plant_id_eia, generator_id, report_date

Additional Details

This table includes only those values that are unique to wind generators. The rest of the columns that are reported in the EIA-860 Wind tabs are included in core_eia860__scd_generators and core_eia__entity_generators.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

design_wind_speed_mph

Average annual wind speed that turbines at this wind site were designed for.

obstacle_id_faa

The Federal Aviation Administration (FAA) obstacle number assigned to this generator. If more than one obstacle number exists, the one that best represents the turbines. References the obstacle numbers reported in the FAA's Digital Obstacle File: https://www.faa.gov/air_traffic/flight_info/aeronav/digital_products/dof/ This field was only reported from 2013 through 2015.

predominant_turbine_manufacturer

Name of predominant manufacturer of turbines at this generator.

predominant_turbine_model

Predominant model number of turbines at this generator.

turbine_hub_height_feet

The hub height of turbines at this generator. If more than one value exists, the one that best represents the turbines.

wind_quality_class

The wind quality class for turbines at this generator. See table core_eia__codes_wind_quality_class for specifications about each class.

core_eia860__scd_ownership

package: pudl

Slowly changing dimension (SCD) table describing attributes of generator ownership. Includes only jointly or third-party owned generators.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 4)

Primary key:

report_date, plant_id_eia, generator_id, owner_utility_id_eia

Columns
report_date

Date reported.

owner_utility_id_eia

The EIA utility Identification number for the owner company that is responsible for the day-to-day operations of the generator, not the operator utility.

operator_utility_id_eia

The EIA utility Identification number for the operator utility.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

owner_utility_name_eia

The name of the EIA owner utility.

owner_state

Two letter ISO-3166 political subdivision code.

owner_city

City of owner.

owner_country

Three letter ISO-3166 country code.

owner_street_address

Steet address of owner.

owner_zip_code

Zip code of owner.

fraction_owned

Proportion of generator ownership attributable to this utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia860__scd_plants

package: pudl

Slowly changing dimension (SCD) table describing attributes of plants which may vary from year to year. Compiled from across all EIA-860 and EIA-923 data.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

plant_id_eia, report_date

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This is one of two tables where canonical values for plants are set. It contains values which are expected to vary slowly, while core_eia__entity_plants contains those expected to remain fixed. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

Date reported.

ash_impoundment

Is there an ash impoundment (e.g. pond, reservoir) at the plant?

ash_impoundment_lined

If there is an ash impoundment at the plant, is the impoundment lined?

ash_impoundment_status

If there is an ash impoundment at the plant, the ash impoundment status as of December 31 of the reporting year.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

datum

Geodetic coordinate system identifier (e.g. NAD27, NAD83, or WGS84).

energy_storage

Indicates if the facility has energy storage capabilities.

ferc_cogen_docket_no

The docket number relating to the FERC cogenerator status. See FERC Form 556.

ferc_cogen_status

Indicates whether the plant has FERC qualifying facility cogenerator status. See FERC Form 556.

ferc_exempt_wholesale_generator_docket_no

The docket number relating to the FERC qualifying facility exempt wholesale generator status.

ferc_exempt_wholesale_generator

Indicates whether the plant has FERC qualifying facility exempt wholesale generator status

ferc_small_power_producer_docket_no

The docket number relating to the FERC qualifying facility small power producer status. See FERC Form 556.

ferc_small_power_producer

Indicates whether the plant has FERC qualifying facility small power producer status. See FERC Form 556.

ferc_qualifying_facility_docket_no

The docket number relating to the FERC qualifying facility cogenerator status. See FERC Form 556.

grid_voltage_1_kv

Plant's grid voltage at point of interconnection to transmission or distribution facilities

grid_voltage_2_kv

Plant's grid voltage at point of interconnection to transmission or distribution facilities

grid_voltage_3_kv

Plant's grid voltage at point of interconnection to transmission or distribution facilities

iso_rto_code

The code of the plant's ISO or RTO. NA if not reported in that year.

liquefied_natural_gas_storage

Indicates if the facility have the capability to store the natural gas in the form of liquefied natural gas.

natural_gas_local_distribution_company

Names of Local Distribution Company (LDC), connected to natural gas burning power plants.

natural_gas_storage

Indicates if the facility have on-site storage of natural gas.

natural_gas_pipeline_name_1

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

natural_gas_pipeline_name_2

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

natural_gas_pipeline_name_3

The name of the owner or operator of natural gas pipeline that connects directly to this facility or that connects to a lateral pipeline owned by this facility.

nerc_region

NERC region in which the plant is located

has_net_metering

Whether the plant has a net metering agreement in effect during the reporting year. (Only displayed for facilities that report the sun or wind as an energy source). This field was only reported up until 2015

pipeline_notes

Additional owner or operator of natural gas pipeline.

primary_purpose_id_naics

North American Industry Classification System (NAICS) code that best describes the primary purpose of the reporting plant

regulatory_status_code

Indicates whether the plant is regulated or non-regulated.

reporting_frequency_code

Code that specifies what time period data has to be reported (i.e. monthly data or annual totals) and how often the power plant reports this data to EIA. See reporting_frequencies_eia for more details.

sector_id_eia

EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

sector_name_eia

EIA assigned sector name, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

service_area

Service area in which plant is located; for unregulated companies, it's the electric utility with which plant is interconnected

transmission_distribution_owner_id

EIA-assigned code for owner of transmission/distribution system to which the plant is interconnected.

transmission_distribution_owner_name

Name of the owner of the transmission or distribution system to which the plant is interconnected.

transmission_distribution_owner_state

State location for owner of transmission/distribution system to which the plant is interconnected.

utility_id_eia

The EIA Utility Identification number.

water_source

Name of water source associated with the plant.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia860__scd_utilities

package: pudl

Slowly changing dimension (SCD) table describing attributes of utilities which may vary from year to year. Compiled from all EIA data.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860 -- Annual Electric Generator Report

Primary key:

utility_id_eia, report_date

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This is one of two tables where canonical values for utilities are set. It contains values which are expected to vary slowly, while core_eia__entity_utilities contains those expected to remain fixed. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.

Columns
utility_id_eia

The EIA Utility Identification number.

report_date

Date reported.

street_address

Physical street address.

city

Name of the city.

state

Two letter US state abbreviation.

zip_code

Five digit US Zip Code.

plants_reported_owner

Is the reporting entity an owner of power plants reported on Schedule 2 of the form?

plants_reported_operator

Is the reporting entity an operator of power plants reported on Schedule 2 of the form?

plants_reported_asset_manager

Is the reporting entity an asset manager of power plants reported on Schedule 2 of the form?

plants_reported_other_relationship

Does the reporting entity have any other relationship to the power plants reported on Schedule 2 of the form?

entity_type

Entity type of principal owner.

attention_line

Mail attention name of the operator/owner.

address_2

Second line of the address.

zip_code_4

Four digit US Zip Code suffix.

contact_firstname

First name of utility contact 1.

contact_lastname

Last name of utility contact 1.

contact_title

Title of of utility contact 1.

phone_number

Phone number for utility contact 1.

phone_extension

Phone extension for utility contact 1

contact_firstname_2

First name of utility contact 2.

contact_lastname_2

Last name of utility contact 2.

contact_title_2

Title of utility contact 2.

phone_number_2

Phone number for utility contact 2.

phone_extension_2

Phone extension for utility contact 2

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia860m__changelog_generators

package: pudl

Changelog table tracking changes in generator characteristics reported in EIA-860M.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 860M -- Monthly Update to the Annual Electric Generator Report

Primary key:

plant_id_eia, generator_id, report_date

Additional Details

EIA-860M includes generator tables with the most up-to-date catalog of EIA generators and their operational status and other generator characteristics. EIA-860M is reported monthly, although for the vast majority of the generators nothing changes month-to-month. This table is a changelog of that monthly reported generator data. There is a record corresponding to the first instance of a generator and associated characteristics with a report_date column and a valid_until_date column. Whenever any of the reported EIA-860M data was changed for a record, there will be a new changelog record with a new report_date.

Columns
report_date

Date reported.

valid_until_date

The record in the changelog is valid until this date. The record is valid from the report_date up until but not including the valid_until_date.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_name_eia

Plant name.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

county

County name.

current_planned_generator_operating_date

The most recently updated effective date on which the generator is scheduled to start operation

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

energy_source_code_1

The code representing the most predominant type of energy that fuels the generator.

energy_storage_capacity_mwh

Energy storage capacity in MWh (e.g. for batteries).

fuel_type_code_pudl

Simplified fuel type code used in PUDL

generator_operating_date

Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.

generator_retirement_date

Date of the scheduled or effected retirement of the generator.

latitude

Latitude of the plant's location, in degrees.

longitude

Longitude of the plant's location, in degrees.

net_capacity_mwdc

Generation capacity in megawatts of direct current that is subject to a net metering agreement. Typically used for behind-the-meter solar PV.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

operational_status_code

The operating status of the asset.

planned_derate_date

Planned effective month that the generator is scheduled to enter operation after the derate modification.

planned_generator_retirement_date

Planned effective date of the scheduled retirement of the generator.

planned_net_summer_capacity_derate_mw

Decrease in summer capacity expected to be realized from the derate modification to the equipment.

planned_net_summer_capacity_uprate_mw

Increase in summer capacity expected to be realized from the modification to the equipment.

planned_uprate_date

Planned effective date that the generator is scheduled to enter operation after the uprate modification.

planned_repower_date

Planned effective date that the generator is scheduled to enter operation after the repowering is complete.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

sector_id_eia

EIA assigned sector ID, corresponding to high level NAICS sector, designated by the primary purpose, regulatory status and plant-level combined heat and power status

state

Two letter US state abbreviation.

summer_capacity_mw

The net summer capacity.

technology_description

High level description of the technology used by the generator to produce electricity.

winter_capacity_mw

The net winter capacity.

core_eia861__assn_balancing_authority

package: pudl

Association table providing connections between state, balancing authority, and utility in a given year.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

report_date, balancing_authority_id_eia, utility_id_eia, state

Columns
report_date

Date reported.

balancing_authority_id_eia

EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.

utility_id_eia

The EIA Utility Identification number.

state

Two letter US state abbreviation.

core_eia861__assn_utility

package: pudl

Association table providing connections between utility and state in a given year.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

report_date, utility_id_eia, state

Columns
report_date

Date reported.

utility_id_eia

The EIA Utility Identification number.

state

Two letter US state abbreviation.

core_eia861__yearly_advanced_metering_infrastructure

package: pudl

Annual time series of advanced metering infrastructure (AMI) and automated meter reading (AMR) by state, sector, and balancing authority.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

balancing_authority_code_eia, customer_class, report_date, state, utility_id_eia

Additional Details

The energy served (in MWH) for AMI systems is provided. Form EIA-861 respondents also report the number of standard meters (non AMR/AMI) in their system. Historical Changes: We started collecting the number of standard meters in 2013. The monthly survey collected these data from January 2011 to January 2017.

Columns
advanced_metering_infrastructure

Number of meters that measure and record usage data at a minimum, in hourly intervals and provide usage data at least daily to energy companies and may also provide data to consumers. Data are used for billing and other purposes. Advanced meters include basic hourly interval meters and extend to real-time meters with built-in two-way communication capable of recording and transmitting instantaneous data.

automated_meter_reading

Number of meters that collect data for billing purposes only and transmit this data one way, usually from the customer to the distribution utility.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

customer_class

High level categorization of customer type (e.g., commercial, residential).

daily_digital_access_customers

Number of customers able to access daily energy usage through a webportal or other electronic means.

direct_load_control_customers

Number of customers with direct load control: a A demand response activity by which the program sponsor remotely shuts down or cycles a customer’s electrical equipment (e.g. air conditioner, water heater) on short notice.

energy_served_ami_mwh

Amount of energy served through AMI meters. AMI meters can transmit data in both directions, between the delivery entity and the customer.

entity_type

Entity type of principal owner.

home_area_network

Number of AMI meters with home area network (HAN) gateway enabled.

non_amr_ami

Number of non-AMR/AMI meters. Usually electromechanical or solid state meters measuring aggregated kWh where data are manually retrieved over monthly billing cycles for billing purposes only. Standard meters may also include functions to measure time-of-use and/or demand with data manually retrieved over monthly billing cycles.

report_date

Date reported.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_balancing_authority

package: pudl

Annual time series of balancing authorities.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

report_date, balancing_authority_id_eia

Columns
report_date

Date reported.

balancing_authority_id_eia

EIA balancing authority ID. This is often (but not always!) the same as the utility ID associated with the same legal entity.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_name_eia

Name of the balancing authority.

core_eia861__yearly_demand_response

package: pudl

Annual time series of demand response programs by state, sector, and balancing authority.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

balancing_authority_code_eia, customer_class, report_date, state, utility_id_eia

Additional Details

The EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.

Columns
actual_peak_demand_savings_mw

Demand reduction actually achieved by demand response activities. Measured at the time of the company's annual system peak hour.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

customer_class

High level categorization of customer type (e.g., commercial, residential).

customer_incentives_cost

Total cost of customer incentives in a given report year. Customer incentives are the total financial value provided to a customer for program participation, whether, for example, cash payment, or lowered tariff rates relative to non-participants, in-kind services (e.g. design work), or other benefits directly provided to the customer for their program participation.

customers

Number of customers.

energy_savings_mwh

The energy savings incurred in a given reporting year by participation in demand response programs.

other_costs

Additional costs.

potential_peak_demand_savings_mw

The total demand savings that could occur at the time of the system peak hour assuming all demand response is called.

report_date

Date reported.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_demand_response_water_heater

package: pudl

Annual time series of grid-connected water heaters enrolled in demand response programs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

balancing_authority_code_eia, report_date, state, utility_id_eia

Columns
balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

report_date

Date reported.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

num_water_heaters

The number of grid-enabled water heaters added to the respondent's program this year - if the respondent has DSM program for grid-enabled water heaters (as defined by DOE’s Office of Energy Efficiency and Renewable Energy).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_demand_side_management_ee_dr

package: pudl

Annual time series of The impact of energy efficiency and load management programs on total energy sold (MWh) and peak demand (MW) by customer class.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Usage Warnings

  • The data is no longer being collected or reported in this way.

Additional Details

The raw EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.

Columns
annual_indirect_program_cost

Costs that have not been included in any program category, but could be meaningfully identified with operating the company’s DSM programs (e.g., Administrative, Marketing, Monitoring & Evaluation, Company-Earned Incentives, Other).

annual_total_cost

The sum of direct program costs, indirect program costs, and incentive payments associated with utility demand side management programs.

customer_class

High level categorization of customer type (e.g., commercial, residential).

energy_efficiency_annual_actual_peak_reduction_mw

The peak reduction incurred in a given reporting year by all participants in efficiency programs.

energy_efficiency_annual_cost

The sum of actual direct costs, incentive payments, and indirect costs incurred in a given reporting year from energy efficiency programs.

energy_efficiency_annual_effects_mwh

The change in energy use incurred in a given reporting year by all participants in energy efficiency programs.

energy_efficiency_annual_incentive_payment

The cost of incentive payments incurred in a given reporting year from energy efficiency programs. Incentives are the total financial value provided to a customer for program participation, whether cash payment, in-kind services (e.g. design work), or other benefits directly provided customer for their program participation.

energy_efficiency_incremental_actual_peak_reduction_mw

The peak reduction incurred in a given reporting year by new participants in existing energy efficiency programs and all participants in new energy efficiency programs.

energy_efficiency_incremental_effects_mwh

The change in energy use incurred in a given reporting year by new participants in existing energy efficiency programs and all participants in new energy efficiency programs.

load_management_annual_actual_peak_reduction_mw

The peak reduction incurred in a given reporting year by all participants in load management programs.

load_management_annual_cost

The sum of actual direct costs, incentive payments, and indirect costs incurred in a given reporting year from load management programs.

load_management_annual_effects_mwh

The change in energy use incurred in a given reporting year by all participants in load management programs.

load_management_annual_incentive_payment

The cost of incentive payments incurred in a given reporting year from load management programs. Incentives are the total financial value provided to a customer for program participation, whether cash payment, in-kind services (e.g. design work), or other benefits directly provided customer for their program participation.

load_management_annual_potential_peak_reduction_mw

The potential amount of peak reduction that could be incurred in a given reporting year by all participants in load management programs.

load_management_incremental_actual_peak_reduction_mw

The peak reduction incurred in a given reporting year by new participants in existing load management programs and all participants in new load management programs.

load_management_incremental_effects_mwh

The change in energy use incurred in a given reporting year by new participants in existing load management programs and all participants in new load management programs.

load_management_incremental_potential_peak_reduction_mw

The potential amount of peak reduction that could be incurred in a given reporting year by new participants in existing load management programs and all participants in new load management programs.

nerc_region

NERC region in which the plant is located

price_responsiveness_customers

The number of customers participating in the respondent's incentive-based demand response programs.

report_date

Date reported.

state

Two letter US state abbreviation.

time_responsiveness_customers

The number of cusomters participating in the respondent's time-based rate programs.

utility_id_eia

The EIA Utility Identification number.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_demand_side_management_misc

package: pudl

Annual time series of demand-side management (DSM) program information.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Usage Warnings

  • The data is no longer being collected or reported in this way.

Additional Details

Includes boolean fields about whether the energy savings estimates/calculations were independently verified and whether the utility runs time and or price responsive programs. Also contains information on whether any of the respondent's DSM activities are reported under another company, and if so which one.

The raw EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.

Columns
energy_savings_estimates_independently_verified

Whether savings estimates are based on a forecast or the report of one or more independent evaluators.

energy_savings_independently_verified

Whether reported energy savings were verified through an independent evaluation.

entity_type

Entity type of principal owner.

major_program_changes

Whether there have been any major changes to the respondent's demand-side management programs (e.g., terminated programs, new information or financing programs, or a shift to programs with dual load building objectives and energy efficiency objectives), program tracking procedures, or reporting methods that affect the comparison of demand-side management data reported on this schedule to data from previous years.

nerc_region

NERC region in which the plant is located

price_responsive_programs

Whether the respondent operates any incentive-based demand response programs (e.g., market incentives, financial incentives, direct load control, interruptible programs, demand bidding/buyback, emergency demand response, capacity market programs, and ancillary service market programs).

report_date

Date reported.

reported_as_another_company

The name of the company if a respondent's demand-side management activities are reported on Schedule 6 of another company’s form.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

has_time_responsive_programs

Whether the respondent operates any time-based rate programs (e.g., real-time pricing, critical peak pricing, variable peak pricing and time-of-use rates administered through a tariff).

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_demand_side_management_sales

package: pudl

Annual time series of electricity sales related to demand-side management (DSM).

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Usage Warnings

  • The data is no longer being collected or reported in this way.

Additional Details

The raw EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.

Columns
nerc_region

NERC region in which the plant is located

report_date

Date reported.

sales_for_resale_mwh

The amount of electricity sold for resale purposes. This entry should include sales for resale to power marketers (reported separately in previous years), full and partial requirements customers, firm power customers and nonfirm customers.

sales_to_ultimate_consumers_mwh

The amount of electricity sold to customers purchasing electricity for their own use and not for resale.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_distributed_generation_fuel

package: pudl

Annual time series of the energy sources used for utility or customer-owned distributed generation capacity.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Usage Warnings

  • The data is no longer being collected or reported in this way.

Additional Details

From 2016 on, there's no great match for the data found in this table. You may be able to compute an analog for fuel percents using the capacity and tech class figures in core_eia861__yearly_non_net_metering_customer_fuel_class, but no corresponding consumption or generation information is available in the new format.

The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.

Columns
estimated_or_actual_fuel_data

Whether the reported fuel data is estimated or actual.

fuel_class

Fuel types specific to EIA 861 distributed generation table: ['gas', 'oil', 'other', 'renewable', 'water', 'wind', 'wood']

fuel_pct

Percent of fuel

report_date

Date reported.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_distributed_generation_misc

package: pudl

Annual time series of the capacity and quantity of utility or customer-owned distributed generation.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Usage Warnings

  • The data is no longer being collected or reported in this way.

Additional Details

From 2016 on, data similar to that found in this table can be found in core_eia861__yearly_non_net_metering_misc.

The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.

Columns
backup_capacity_mw

The total nameplate capacity of generators that are used only for emergency backup service.

distributed_generation_owned_capacity_mw

Amount of distributed generation capacity owned by the respondent.

estimated_or_actual_capacity_data

Whether the reported capacity data is estimated or actual.

generators_num_less_1_mw

Total number of generators less than 1 MW.

generators_number

Total number of generators

report_date

Date reported.

state

Two letter US state abbreviation.

total_capacity_less_1_mw

The total amount of capacity from generators with less than 1 MW of nameplate capacity.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_distributed_generation_tech

package: pudl

Annual time series of the technology used for utility or customer-owned distributed generation.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Usage Warnings

  • The data is no longer being collected or reported in this way.

Additional Details

From 2016 on, data similar to that found in this table can be found in core_eia861__yearly_non_net_metering_customer_fuel_class.

The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.

Columns
capacity_mw

Total installed (nameplate) capacity, in megawatts.

estimated_or_actual_tech_data

Whether the reported technology data is estimated or actual.

report_date

Date reported.

state

Two letter US state abbreviation.

tech_class

Type of technology specific to EIA 861 distributed generation and net generation tables: ['backup', 'chp_cogen', 'combustion_turbine', 'fuel_cell', 'hydro', 'internal_combustion', 'other', 'pv', 'steam', 'storage_pv', 'storage_nonpv', 'all_storage', 'total', 'virtual_pv', 'virtual_pv_under_1mw', 'virtual_pv_over_1mw', 'wind'].

utility_id_eia

The EIA Utility Identification number.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_distribution_systems

package: pudl

Annual time series of distribution circuits and circuits with voltage optimization by state.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Columns
circuits_with_voltage_optimization

Number of distribution circuits that employ voltage/VAR optimization (VVO).

distribution_circuits

Total number of distribution circuits.

report_date

Date reported.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_dynamic_pricing

package: pudl

Annual time series of enrollment in dynamic pricing programs by state, sector, and balancing authority.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Additional Details

Respondents check if one or more customers are enrolled in time-of-use pricing, real time pricing, variable peak pricing, critical peak pricing, and critical peak rebates.

Columns
balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

critical_peak_pricing

Whether customers are participating in critical peak pricing, a program in which rate and/or price structure is designed to encourage reduced consumption during periods of high wholesale market prices or system contingencies, by imposing a pre-specified high rate or price for a limited number of days or hours.

critical_peak_rebate

Whether customers are participating in critical peak rebates, a program in which rate and/or price structure is designed to encourage reduced consumption during periods of high wholesale market prices or system contingencies, by providing a rebate to the customer on a limited number of days and for a limited number of hours, at the request of the energy provider.

customer_class

High level categorization of customer type (e.g., commercial, residential).

customers

Number of customers.

real_time_pricing

Whether the respondent has customers participating in a real time pricing (RTP) program. RTP is a program of rate and price structure in which the retail price for electricity typically fluctuates hourly or more often, to reflect changes in the wholesale price of electricity on either a day- ahead or hour-ahead basis.

report_date

Date reported.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

time_of_use_pricing

Whether the respondent has customers participating in a time-of-use pricing programs (TOU). TOU is a program in which customers pay different prices at different times of the day. On-peak prices are higher and off-peak prices are lower than a “standard” rate. Price schedule is fixed and predefined, based on season, day of week, and time of day.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

variable_peak_pricing

Whether the respondent has customers participating in a variable peak pricing program (VPP). VPP is a program in which a form of TOU pricing allows customers to purchase their generation supply at prices set on a daily basis with varying on-peak and constant off-peak rates. Under the VPP program, the on-peak price for each weekday becomes available the previous day (typically late afternoon) and the customer is billed for actual consumption during the billing cycle at these prices.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_energy_efficiency

package: pudl

Annual time series of incremental energy savings, peak demand savings, weighted average life cycle, and associated costs for the reporting year and life cycle of energy efficiency programs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Additional Details

The EIA861 demand-side management (DSM) table (split into three normalized tables in PUDL) contain data through 2012. The form changed in 2013 and split the contents of the DSM table into energy efficiency and demand response tables. Though similar, the information collected before and after 2012 are not comparable enough to combine into a singular, continuous table. We were discouraged from doing so after contacting a representative from EIA.

Columns
balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

customer_class

High level categorization of customer type (e.g., commercial, residential).

customer_incentives_incremental_cost

The cost of customer incentives resulting from new participants in existing energy efficiency programs and all participants in new energy efficiency programs. Customer incentives are the total financial value provided to a customer for program participation, whether, for example, cash payment, or lowered tariff rates relative to non-participants, in-kind services (e.g. design work), or other benefits directly provided to the customer for their program participation.

customer_incentives_incremental_life_cycle_cost

All anticipated costs of the customer incentives including reporting year incremental costs and all future costs. Customer incentives are the total financial value provided to a customer for program participation, whether, for example, cash payment, or lowered tariff rates relative to non-participants, in-kind services (e.g. design work), or other benefits directly provided to the customer for their program participation.

customer_other_costs_incremental_life_cycle_cost

All anticipated costs other than customer incentives. Includes reporting year incremental costs and all future costs.

incremental_energy_savings_mwh

Energy savings in the given report year resulting from new participants in existing demand response programs and all participants in new demand response programs.

incremental_life_cycle_energy_savings_mwh

The estimated total changes in energy use for incremental programs and participants over the life of the programs. DSM programs have a useful life, and the net effects of these programs will diminish over time. Considers the useful life of energy efficiency technology by accounting for building demolition, equipment degradation, and program attrition.

incremental_life_cycle_peak_reduction_mw

The estimated total changes in peak load for incremental programs and participants over the life of the programs. DSM programs have a useful life, and the net effects of these programs will diminish over time. Considers the useful life of energy efficiency technology by accounting for building demolition, equipment degradation, and program attrition.

incremental_peak_reduction_mw

The peak reduction incurred in a given reporting year by new participants in existing energy efficiency programs and all participants in new energy efficiency programs.

other_costs_incremental_cost

Costs resulting from new participants in existing energy efficiency programs and all participants in new energy efficiency programs that aren't directly associated with customer incentives.

report_date

Date reported.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

weighted_average_life_years

The weighted average life of the respondent's portfolio of energy efficiency programs.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_green_pricing

package: pudl

Annual time series of green pricing program revenue, sales, and customer count by sector and state.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Usage Warnings

  • The data is no longer being collected or reported in this way.

Columns
customer_class

High level categorization of customer type (e.g., commercial, residential).

customers

Number of customers.

green_pricing_revenue

The money derived from premium green pricing rate of the respondent'sprogram.

rec_revenue

Amount of revenue collected from Renewable Energy Certificates (RECs).

rec_sales_mwh

Amount of sales collected from Renewable Energy Certificates (RECs).

report_date

Date reported.

sales_mwh

Quantity of electricity sold in MWh.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_mergers

package: pudl

Annual time series of utility mergers and acquisitions.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Columns
entity_type

Entity type of principal owner.

merge_address

Address of new parent company.

merge_city

City of new parent company.

merge_company

Name of the company merged with or acquired.

merge_date

Date of merger or acquisition.

merge_state

Two letter US state abbreviations and three letter ISO-3166-1 country codes for international mines.

new_parent

Name of the new parent company post merger.

report_date

Date reported.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

zip_code

Five digit US Zip Code.

zip_code_4

Four digit US Zip Code suffix.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_net_metering_customer_fuel_class

package: pudl

Annual time series of net metering by customer and fuel class.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Usage Warnings

  • Some years use a slightly different data definition.

Additional Details

The amount of energy sold to back to the grid. From 2007 - 2009 the data are reported as a lump sum of total energy dispatched by sector. After 2009, the data are broken down by sector and technology type.

Columns
balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

energy_capacity_mwh

The total amount of energy which the system can supply power before recharging is necessary, in megawatt-hours.

customer_class

High level categorization of customer type (e.g., commercial, residential).

customers

Number of customers.

report_date

Date reported.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

sold_to_utility_mwh

The amount of electric energy sold back to the utility through the net metering application.

state

Two letter US state abbreviation.

tech_class

Type of technology specific to EIA 861 distributed generation and net generation tables: ['backup', 'chp_cogen', 'combustion_turbine', 'fuel_cell', 'hydro', 'internal_combustion', 'other', 'pv', 'steam', 'storage_pv', 'storage_nonpv', 'all_storage', 'total', 'virtual_pv', 'virtual_pv_under_1mw', 'virtual_pv_over_1mw', 'wind'].

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_net_metering_misc

package: pudl

Annual time series of PV current flow type for net metered capacity.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Columns
balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

pv_current_flow_type

Current flow type for photovoltaics: AC or DC

report_date

Date reported.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_non_net_metering_customer_fuel_class

package: pudl

Annual time series of non-net metered distributed generation by sector and technology type.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Additional Details

This table's data starts in 2016. For 2015 and earlier, data similar to that found in this table can be found in core_eia861__yearly_distributed_generation_tech.

The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.

Columns
balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

energy_capacity_mwh

The total amount of energy which the system can supply power before recharging is necessary, in megawatt-hours.

customer_class

High level categorization of customer type (e.g., commercial, residential).

report_date

Date reported.

state

Two letter US state abbreviation.

tech_class

Type of technology specific to EIA 861 distributed generation and net generation tables: ['backup', 'chp_cogen', 'combustion_turbine', 'fuel_cell', 'hydro', 'internal_combustion', 'other', 'pv', 'steam', 'storage_pv', 'storage_nonpv', 'all_storage', 'total', 'virtual_pv', 'virtual_pv_under_1mw', 'virtual_pv_over_1mw', 'wind'].

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_non_net_metering_misc

package: pudl

Annual time series of non-net metered distributed generation generators, pv current flow type, backup capacity and utility owned capacity.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key.

Additional Details

This table's data starts in 2016. For 2015 and earlier, data similar to that found in this table can be found in core_eia861__yearly_distributed_generation_misc.

The raw EIA861 distributed generation (DG) table (split into three normalized tables in PUDL) was renamed in 2016 to Non-Net Metering to prevent double counting. The data in the Non-Net Metering table (2016+) are split by sector, contain fuel cell information, and convert capacity reported in DC units to AC units.

Columns
backup_capacity_mw

The total nameplate capacity of generators that are used only for emergency backup service.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

generators_number

Total number of generators

pv_current_flow_type

Current flow type for photovoltaics: AC or DC

report_date

Date reported.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_owned_capacity_mw

Total non-net-metered capacity owned by the respondent.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_operational_data_misc

package: pudl

Annual time series of megawatt hours (MWH) for the sources and disposition of a utility's electricity.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report (Schedule 2B)

Primary key:

utility_id_eia, nerc_region, report_date

Usage Warnings

  • Some values have been redacted.

  • Contains rows where missing values were imputed.

Additional Details

Respondents are required to report this information to the EIA, but are not required to disclose utility-level data to the public. When a respondent chooses to keep its utility-level data proprietary, it files using EIA utility id 88888. For more details, see EIA-861 Notable Irregularities <eia861-notable-irregularities>.

Rows where data_observed is False were labeled as imputed in the raw EIA data. The EIA documentation does not specify what imputation applies to operational data, only net metering and non net-metering distributed data. Less than 1% of rows are labeled as imputed, all of which occur in 2004 or 2005. Imputed rows make up no more than 25% of the rows for any utility.

Columns
consumed_by_facility_mwh

The amount of electricity used by the facility.

consumed_by_respondent_without_charge_mwh

The amount of electricity used by the electric utility in its electric and other departments without charge.

data_observed

Is the value observed (True) or imputed (False).

entity_type

Entity type of principal owner.

exchange_energy_delivered_mwh

The amount of exchange energy delivered. Does not include power delivered as part of a tolling arrangement.

exchange_energy_received_mwh

The amount of exchange energy received. Does not include power received through tolling arrangements.

furnished_without_charge_mwh

The amount of electricity furnished by the electric utility without charge, such as to a municipality under a franchise agreement or for street and highway lighting.

nerc_region

NERC region in which the plant is located

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

net_power_exchanged_mwh

The net amount of energy exchanged. Net exchange is the difference between the amount of exchange received and the amount of exchange delivered. This entry should not include wholesale energy purchased from or sold to regulated companies or unregulated companies for other systems.

net_wheeled_power_mwh

The difference between the amount of energy entering the respondent's system (wheeled received) for transmission through the respondent's system and the amount of energy leaving the respondent's system (wheeled delivered). Wheeled net represents the energy losses on the respondent's system associated with the wheeling of energy for other systems.

report_date

Date reported.

retail_sales_mwh

MWh of sales to end-use customers in areas where the customer has been given the legal right to select a power supplier other than the traditional, vertically integrated electric utility.

sales_for_resale_mwh

The amount of electricity sold for resale purposes. This entry should include sales for resale to power marketers (reported separately in previous years), full and partial requirements customers, firm power customers and nonfirm customers.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

summer_peak_demand_mw

The maximum hourly summer load (for the months of June through September) based on net energy for the system during the reporting year. Net energy for the system is the sum of energy an electric utility needs to satisfy their service area and includes full and partial wholesale requirements customers, and the losses experienced in delivery. The maximum hourly load is determined by the interval in which the 60-minute integrated demand is the greatest.

total_disposition_mwh

Sum of all disposition of electricity listed. Includes sales to ultimate customers, sales for resale, energy furnished without charge, energy consumed by respondent without charge and total energy losses.

total_energy_losses_mwh

The total amount of electricity lost from transmission, distribution, and/or unaccounted for. Should be expressed as a positive number.

total_sources_mwh

Sum of all sources of electricity listed. Includes net generation, purchases from electricity suppliers, net exchanges (received - delivered), net wheeled (received - delivered), transmission by others, and losses.

transmission_by_other_losses_mwh

The amount of energy losses associated with the wheeling of electricity provided to the respondent's system by other utilities. Transmission by others, losses should always be a negative value.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

wheeled_power_delivered_mwh

The total amount of energy leaving the respondent's system that was transmitted through the respondent's system for delivery to other systems. If wheeling delivered is not precisely known, the value is an estimate based on the respondent's system's known percentage of losses for wheeling transactions.

wheeled_power_received_mwh

The total amount of energy entering the respondent's system from other systems for transmission through the respondent's system (wheeling) for delivery to other systems. Does not include energy purchased or exchanged for consumption within the respondent's system, which was wheeled to the respondent by others.

wholesale_power_purchases_mwh

Purchases from electricity suppliers.

winter_peak_demand_mw

The maximum hourly winter load (for the months of January through March) based on net energy for the system during the reporting year. Net energy for the system is the sum of energy an electric utility needs to satisfy their service area and includes full and partial wholesale requirements customers, and the losses experienced in delivery. The maximum hourly load is determined by the interval in which the 60-minute integrated demand is the greatest.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_operational_data_revenue

package: pudl

Annual time series of utility revenue by type of electric operating revenue.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report (Schedule 2C)

Primary key:

utility_id_eia, nerc_region, report_date, revenue_class

Usage Warnings

  • Some values have been redacted.

Additional Details

A utility's revenue by type of electric operating revenue. Includes electric operating revenue from sales to ultimate customers, revenue from unbundled (delivery) customers, revenue from sales for resale, electric credits/other adjustments, revenue from transmission, other electric operating revenue, and total electric operating revenue.

Respondents are required to report this information to the EIA, but are not required to disclose utility-level data to the public. When a respondent chooses to keep its utility-level data proprietary, it files using EIA utility id 88888. For more details, see EIA-861 Notable Irregularities <eia861-notable-irregularities>.

Columns
nerc_region

NERC region in which the plant is located

report_date

Date reported.

revenue

Amount of revenue.

revenue_class

Source of revenue (e.g., retail sales, transmission).

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_reliability

package: pudl

Annual time series of electricity system reliability and outage impacts.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report (Schedules 3B and 3C)

Primary key:

utility_id_eia, state, report_date, standard

Note: standard is included because while respondents are asked to only fill out one of parts B or C, sometimes they fill out both.

Additional Details

Contains information on non-momentary electrical interruptions. Includes the system average interruption duration index (SAIDI), system average interruption frequency index (SAIFI), and customer average interruption duration index (CAIDI) aka SAIDI/SAIFI with and without major event days and loss of service. Includes the standard (IEEE/other) and other relevant information.

SAIDI is measured in minutes, and represents the sum over the year of all customers interrupted times the number of minutes they experienced an interruption, divided by total number of customers.

SAIFI is measured in customers, and represents the sum over the year of the total number of customers that experienced an interruption, divided by the total number of customers.

The IEEE standards which can be used to calculate SAIDI and SAIFI include IEEE 1366-2003 and IEEE 1366-2012. These standards define momentary interruptions as having a duration of five minutes or less. If one of these IEEE standards is used, respondents fill out part B and are required to specify:

  • SAIDI major event days minus loss of supply

  • SAIFI major event days minus loss of supply

SAIDI and SAIFI major event days minus loss of supply excludes events where the reliability event resulted from an event on the distribution system, not from the high-voltage system. The voltage that distinguishes the distribution system from the supply system is given in highest_distribution_voltage_kv.

If a method other than these IEEE standards is used for calculating SAIDI and SAIFI indexes, respondents fill out part C and are required to specify:

  • whether inactive accounts are included

  • how they define a momentary interruption (less than 1 minute, less than or equal to 5 minutes, or other)

In this table, column standard is "ieee_standard" for respondents who have filled out part B, and "other_standard" for respondents who have filled out part C.

Columns
caidi_w_major_event_days_minus_loss_of_service_minutes

Average number of minutes per interruption (SAIDI/SAIFI) including major event days and excluding reliability events caused by a loss of supply.

caidi_w_major_event_days_minutes

Average number of minutes per interruption (SAIDI/SAIFI) including major event days.

caidi_wo_major_event_days_minutes

Average number of minutes per interruption (SAIDI/SAIFI) excluding major event days.

customers

Number of customers.

entity_type

Entity type of principal owner.

highest_distribution_voltage_kv

The highest voltage that's part of the distribution system.

inactive_accounts_included

Whether the respondent includes inactive accounts in its definition of customers used to determine SAIDI and SAIFI.

momentary_interruption_definition

How the respondent defines momentary service interruptions: less than 1 min, equal to or less than 5 min, or some other way.

outages_recorded_automatically

Whether the information about customer outages is recorded automatically.

report_date

Date reported.

saidi_w_major_event_days_minus_loss_of_service_minutes

Cumulative duration (minutes) of interruption for the average customer during the report year including major event days and excluding reliability events caused by a loss of supply.

saidi_w_major_event_days_minutes

Cumulative duration (minutes) of interruption for the average customer during the report year including major event days.

saidi_wo_major_event_days_minutes

Cumulative duration (minutes) of interruption for the average customer during the report year excluding major event days.

saifi_w_major_event_days_customers

Average number of times a customer experienced a sustained interruption (over 5 minutes) during the report year including major event days.

saifi_w_major_event_days_minus_loss_of_service_customers

Average number of times a customer experienced a sustained interruption (over 5 minutes) during the report year including major event days and excluding reliability events caused by a loss of supply.

saifi_wo_major_event_days_customers

Average number of times a customer experienced a sustained interruption (over 5 minutes) during the report year excluding major event days.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

standard

Whether the respondent calculates SAIDI/SAIFI, and major event days according to the IEEE or an Other standard.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_sales

package: pudl

Annual time series of electricity sales to ultimate customers by utility, balancing authority, state, and customer class.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

utility_id_eia, state, report_date, balancing_authority_code_eia, customer_class, business_model, service_type

Columns
utility_id_eia

The EIA Utility Identification number.

state

Two letter US state abbreviation.

report_date

Date reported.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

customer_class

High level categorization of customer type (e.g., commercial, residential).

business_model

Business model.

data_observed

Is the value observed (True) or imputed (False).

entity_type

Entity type of principal owner.

service_type

The type of service the respondent provides to a given customer class.Bundled: both energy and delivery; energy: just the energy consumed; delivery: just the billing and energy delivery services.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

utility_name_eia

The name of the utility.

customers

Number of customers.

sales_mwh

Quantity of electricity sold in MWh.

sales_revenue

Revenue from electricity sold.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_service_territory

package: pudl

Annual time series of counties in utility service territories.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

report_date, utility_id_eia, county_id_fips

Columns
county

County name.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

report_date

Date reported.

state_id_fips

Two digit state FIPS code.

county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_short_form

package: pudl

Annual time series of data from the short form (EIA-861S).

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

utility_id_eia, state, report_date, balancing_authority_code_eia

Usage Warnings

  • Some years are missing from the data record.

Additional Details

The data started being reported in 2012. However, the 2019 data is not available. They are expected to submit the completed Form EIA-861S to EIA by April 30th, following the end of the prior calendar year. Utilities report on Form EIA-861S if they:

  • Report less than 200,000 megawatthours on the last previous Form EIA-861.

  • Provide only bundled service (generation and distribution).

  • Are not needed to ensure acceptable quality of statistical estimates.

  • Are not part of the aggregate TVA or WPPI.

  • Do not report on Form EIA-861M.

Columns
report_date

Date reported.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

entity_type

Entity type of principal owner.

state

Two letter US state abbreviation.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

sales_revenue

Revenue from electricity sold.

sales_mwh

Quantity of electricity sold in MWh.

customers

Number of customers.

has_net_metering

Whether the plant has a net metering agreement in effect during the reporting year. (Only displayed for facilities that report the sun or wind as an energy source). This field was only reported up until 2015

has_demand_side_management

Whether there were strategies or measures used to control electricity demand by customers

has_time_responsive_programs

Whether the respondent operates any time-based rate programs (e.g., real-time pricing, critical peak pricing, variable peak pricing and time-of-use rates administered through a tariff).

has_green_pricing

Whether a green pricing program was associated with this utility during the reporting year.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_utility_data_misc

package: pudl

Annual time series of utility business activities.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key. The primary key would have been: utility_id_eia, state, report_date, and nerc_region, but there are nulls in the state column across several years of reporting.

Additional Details

This includes whether they operate alternative fuel vehicles, whether they provide transmission, distribution, or generation services (bundled or unbundled), and whether they engage in wholesale and/or retail markets.

Columns
alternative_fuel_vehicle_2_activity

Whether the utility plants to operate alternative-fueled vehicles this coming year.

alternative_fuel_vehicle_activity

Whether the utility operates alternative-fueled vehicles during the year.

bundled_activity

Whether a utility engaged in combined utility services (electricity plus other services such as gas, water, etc. in addition to electric services) during the year.

buying_distribution_activity

Whether a utility bought any distribution on other electrical systems during the year.

buying_transmission_activity

Whether a utility bought any transmission services on other electrical systems during the year.

distribution_activity

Whether a utility engaged in any distribution using owned/leased electrical wires during the year.

entity_type

Entity type of principal owner.

generation_activity

Whether a utility utilized generation from company owned plant during the year.

nerc_region

NERC region in which the plant is located

operates_generating_plant

Whether the respondent operated at least one generating plant during the reporting period.

report_date

Date reported.

retail_marketing_activity

Whether a utility engaged in retail power marketing during the year.

short_form

Whether the reported information comes from the short form. In the case of form EIA 861, a shorter version of the form was created in 2012 to reduce respondent burden on smaller utilities and increase our processing efficiency.

state

Two letter US state abbreviation.

transmission_activity

Whether a utility engaged in any transmission activities during the year.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

wholesale_marketing_activity

Whether a utility engages in wholesale power marketing during the year.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_utility_data_nerc

package: pudl

Annual time series of the NERC regions that utilities operate in.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key. The primary key would have been: utility_id_eia, state, report_date, nerc_region, and nerc_regions_of_operation, but there are nulls in the state column across several years of reporting.

Columns
nerc_region

NERC region in which the plant is located

nerc_regions_of_operation

All the regional entities within the North American Electric Reliability Corporation (NERC) in which the respodent conducts operations.

report_date

Date reported.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia861__yearly_utility_data_rto

package: pudl

Annual time series of the RTOs that utilities operate in.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 861 -- Annual Electric Power Industry Report

Primary key:

This table has no primary key. The primary key would have been: utility_id_eia, state, report_date, nerc_region, and rtos_of_operation, but there are nulls in the state column across several years of reporting.

Columns
nerc_region

NERC region in which the plant is located

report_date

Date reported.

rtos_of_operation

The ISOs/RTOs, in which the respondent conducts operations.

state

Two letter US state abbreviation.

utility_id_eia

The EIA Utility Identification number.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia923__entity_coalmine

package: pudl

Entity table containing static information about coal mines reporting deliveries in the Fuel Receipts and Costs table.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 2, Part C)

Primary key:

mine_id_pudl

Usage Warnings

  • Mine ID's are not as well defined and consistent as other EIA ID's.

Additional Details

This table is produced during the transformation of fuel delivery data, in order to produce a better normalized database. The same coalmines report many individual deliveries, and repeating their attributes many times in the fuel receipts and costs table is duplicative. Unfortunately the coalmine attributes do not generally use a controlled vocabulary or well defined IDs and so in practice there are many distinct records in this table that correspond to the same mines in reality.

We have not yet taken the time to rigorously clean this data, but it could be linked with both Mining Safety and Health Administration (MSHA) and USGS data to provide more insight into where coal is coming from, and what the employment and geological context is for those supplies.

Columns
mine_id_pudl

Dynamically assigned PUDL mine identifier.

mine_name

Coal mine name.

mine_type_code

Type of coal mine.

state

Two letter US state abbreviation.

county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4.

mine_id_msha

MSHA issued mine identifier.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia923__monthly_boiler_fuel

package: pudl

Monthly time series of boiler fuel consumption and emissions.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, boiler_id, energy_source_code, prime_mover_code, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

Additional Details

Reports the quantity of each type of fuel consumed by each boiler on a monthly basis, as well as the sulfur and ash content of those fuels. Fuel quantity is reported in standard EIA fuel units (tons, barrels, Mcf). Heat content per unit of fuel is also reported, making this table useful for calculating the thermal efficiency (heat rate) of various generation units.

This table provides better coverage of the entire fleet of generators than the core_eia923__monthly_generation_fuel table, but the fuel consumption reported here is not directly associated with a generator. This complicates the heat rate calculation, since the associations between individual boilers and generators are incomplete and can be complex.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

boiler_id

Alphanumeric boiler ID.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_type_code_pudl

Simplified fuel type code used in PUDL

report_date

Date reported.

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

sulfur_content_pct

Sulfur content percentage by weight to the nearest 0.01 percent.

ash_content_pct

Ash content percentage by weight to the nearest 0.1 percent.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia923__monthly_energy_storage

package: pudl

Monthly time series of energy consumed and discharged ("generated") by energy storage units.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, report_date, prime_mover_code, energy_source_code

Usage Warnings

  • Date column arbitrarily uses the first of the month.

Additional Details

The total MWh discharged from the energy storage unit during the reporting period is the gross generation and the difference between gross generation and consumption is the net generation.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

Date reported.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

fuel_units

Reported unit of measure for fuel.

fuel_consumed_for_electricity_units

Consumption for electric generation of the fuel type in physical unit.

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

gross_generation_mwh

Gross electricity generation for the specified period in megawatt-hours (MWh).

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

core_eia923__monthly_fuel_receipts_costs

package: pudl

Monthly time series of fuel deliveries to power plants.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 2 - Part A)

Primary key:

This table has no primary key. Each record describes an individual fuel delivery. There can be multiple deliveries of the same type of fuel from the same supplier to the same plant in a single month, so the table has no natural primary key.

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Some values have been redacted.

  • Time of fuel deliveries is not necessarily connected with time of fuel consumption.

Additional Details

There can be a significant delay between the receipt of fuel and its consumption, so using this table to infer monthly attributes associated with power generation may not be entirely accurate. However, this is the most granular data we have describing fuel costs, and we use it in calculating the marginal cost of electricity for individual generation units.

Under some circumstances utilities are allowed to treat the price of fuel as proprietary business data, meaning it is redacted from the publicly available spreadsheets. It's still reported to EIA and influences the aggregated (state, region, annual, etc.) fuel prices they publish. From 2009-2021 about 1/3 of all prices are redacted. The missing data is not randomly distributed. Deregulated markets dominated by merchant generators (independent power producers) redact much more data, and natural gas is by far the most likely fuel to have its price redacted. This means, for instance, that the entire Northeastern US reports essentially no fine-grained data about its natural gas prices.

Additional data which we haven't yet integrated is available in a similar format from 2002-2008 via the EIA-423, and going back as far as 1972 from the FERC-423.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

Date reported.

contract_type_code

Purchase type under which receipts occurred in the reporting month. C: Contract, NC: New Contract, S: Spot Purchase, T: Tolling Agreement.

contract_expiration_date

Date contract expires.Format: MMYY.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_group_code

Fuel groups used in the Electric Power Monthly

mine_id_pudl

Dynamically assigned PUDL mine identifier.

supplier_name

Company that sold the fuel to the plant or, in the case of Natural Gas, pipeline owner.

fuel_received_units

Quantity of fuel received in tons, barrel, or Mcf.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

sulfur_content_pct

Sulfur content percentage by weight to the nearest 0.01 percent.

ash_content_pct

Ash content percentage by weight to the nearest 0.1 percent.

mercury_content_ppm

Mercury content in parts per million (ppm) to the nearest 0.001 ppm.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

primary_transportation_mode_code

Transportation mode for the longest distance transported.

secondary_transportation_mode_code

Transportation mode for the second longest distance transported.

natural_gas_transport_code

Contract type for natural gas transportation service.

natural_gas_delivery_contract_type_code

Contract type for natural gas delivery service:

moisture_content_pct

For coal only: the moisture content of the fuel in terms of moisture percentage by weight. Reported to the nearest 0.01 percent.

chlorine_content_ppm

For coal only: the chlorine content in parts per million (ppm) to the nearest 0.001 ppm. If lab tests of the coal do not include the chlorine content, this field contains the amount specified in the contract with the supplier.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia923__monthly_generation

package: pudl

Monthly time series of net electricity generated by power plant generators.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, generator_id, report_date

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • Not all IDs are present.

  • Table has known low coverage - either geographic or temporal or otherwise.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

Additional Details

This is the most granular information we have about how much electricity individual generators are producing, but only about half of all the generation reported in the core_eia923__monthly_generation_fuel appears in this table due to the different reporting requirements imposed on different types and sizes of generators.

Whenever possible, we use this generator-level net generation to estimate the heat rates of generation units and the marginal cost of electricity on a per-generator basis, but those calculations depend on plant-level fuel costs and sometimes uncertain or incomplete boiler-generator associations.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

report_date

Date reported.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia923__monthly_generation_fuel

package: pudl

Monthly time series of generation and fuel consumption by fuel and prime mover.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, report_date, prime_mover_code, energy_source_code

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

Additional Details

This data can't be easily linked to individual boilers, generators, and generation units, but it is provides the most complete coverage of fuel consumption and electricity generation for the entire generation fleet. We use the primary fuels and prime movers reported for each generator along with their capacities to attribute fuel consumption and generation when it isn't directly reported in the core_eia923__monthly_generation and core_eia923__monthly_boiler_fuel tables in order to calculate capacity factors, heat rates, and the marginal cost of electricity.

The table makes a distinction between all fuel consumed and fuel consumed for electricity generation because some units are also combined heat and power (CHP) units, and also provide high temperature process heat at the expense of net electricity generation.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_type_code_agg

A partial aggregation of the reported fuel type codes into larger categories used by EIA in, for example, the Annual Energy Review (AER) or Monthly Energy Review (MER). Two or three letter alphanumeric.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_units

Consumption for electric generation of the fuel type in physical unit.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia923__monthly_generation_fuel_nuclear

package: pudl

Monthly time series of generation and fuel consumption of nuclear generation units by fuel and prime mover.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 3)

Primary key:

plant_id_eia, report_date, nuclear_unit_id, energy_source_code, prime_mover_code

Usage Warnings

  • Date column arbitrarily uses the first of the month.

  • A small number of respondents only report annual fuel consumption, and all of it is reported in December.

Additional Details

This data is originally reported alongside similar information for fossil fuel plants, but the nuclear data is reported by (nuclear) generation unit rather than fuel type and prime mover, and so has a different primary key.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_date

Date reported.

nuclear_unit_id

For nuclear plants only, the unit number .One digit numeric. Nuclear plants are the only type of plants for which data are shown explicitly at the generating unit level.

energy_source_code

A 2-3 letter code indicating the energy source (e.g. fuel type) associated with the record.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_type_code_agg

A partial aggregation of the reported fuel type codes into larger categories used by EIA in, for example, the Annual Energy Review (AER) or Monthly Energy Review (MER). Two or three letter alphanumeric.

prime_mover_code

Code for the type of prime mover (e.g. CT, CG)

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_units

Consumption for electric generation of the fuel type in physical unit.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_consumed_mmbtu

Total consumption of fuel in physical unit, year to date. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_consumed_for_electricity_mmbtu

Total consumption of fuel to produce electricity, in physical unit, year to date.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

core_eia930__hourly_interchange

package: pudl

Hourly time series of balancing authority interchange.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report

Primary key:

datetime_utc, balancing_authority_code_eia, balancing_authority_code_adjacent_eia

Additional Details

Interchange is the net metered tie line flow from one BA to another directly interconnected BA. Total net interchange is the net sum of all interchange occurring between a BA and its directly interconnected neighboring BAs. Negative interchange values indicate net inflows, and positive interchange values indicate net outflows.

Columns
datetime_utc

Timestamp at the end of the hour for which the data is reported.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_code_adjacent_eia

EIA short code for the other adjacent balancing authority, with which interchange is occurring. Includes Canadian and Mexican BAs.

interchange_reported_mwh

Original reported energy interchange between adjacent balancing authorities.

core_eia930__hourly_net_generation_by_energy_source

package: pudl

Hourly time series of balancing authority net generation by energy source.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report

Primary key:

datetime_utc, balancing_authority_code_eia, generation_energy_source

Usage Warnings

  • Contains known calculation discrepancies.

Additional Details

BAs report generation from dual-fuel (switching from one fuel to another) and multiple-fuel (using multiple fuels simultaneously) generators under the actual energy source used, if known, and under the generator's primary energy source, if not known.

To maintain generator confidentiality, generation may sometimes be reported in the Other category if too few generators are reported for a particular energy source category.

In theory, the sum of net generation across all energy sources should equal the total net generation reported in the balancing authority operations table. In practice, there are many cases in which these values diverge significantly, which require further investigation.

Columns
datetime_utc

Timestamp at the end of the hour for which the data is reported.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

generation_energy_source

High level energy source used to produce electricity.

net_generation_reported_mwh

Unaltered originally reported net generation for the specified period.

net_generation_adjusted_mwh

Reported net generation adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.

net_generation_imputed_eia_mwh

Reported net generation with outlying values removed and missing values imputed by EIA.

core_eia930__hourly_operations

package: pudl

Hourly time series of balancing authority net generation, interchange, and demand.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report

Primary key:

datetime_utc, balancing_authority_code_eia

Additional Details

Net generation represents the metered output of electric generating units in a BA's electric system. This generation only includes generating units that are managed by a BA or whose operations are visible to a BA.

Generators on the distribution system—both large-scale resources and small-scale distributed resources, such as rooftop solar photovoltaic (PV) systems—are typically not included.

In some electricity publications, EIA reports generation from all utility-scale generating units in the United States. BAs only meter generating units that are from a subset of all utility-scale generating units. As a result, when hourly generation from the EIA-930 is aggregated to monthly or annual levels, the results will be lower than monthly and annual aggregations in other EIA electricity publications.

Interchange is the net metered tie line flow from one BA to another directly interconnected BA. This table includes the net sum of all interchange occurring between a BA and all of its directly interconnected neighboring BAs. For a detailed breakdown of interchange between each pair of adjacent BAs see core_eia930__hourly_interchange.

Negative interchange values indicate net inflows, and positive interchange values indicate net outflows.

Demand is a calculated value representing the amount of electricity load within a BA's electric system. A BA derives its demand value by taking the total metered net electricity generation within its electric system and subtracting the total metered net electricity interchange occurring between the BA and its neighboring BAs.

Each BA produces a day-ahead electricity demand forecast for every hour of the next day. These forecasts help BAs plan for and coordinate the reliable operation of their electric system.

Columns
datetime_utc

Timestamp at the end of the hour for which the data is reported.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

net_generation_reported_mwh

Unaltered originally reported net generation for the specified period.

net_generation_adjusted_mwh

Reported net generation adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.

net_generation_imputed_eia_mwh

Reported net generation with outlying values removed and missing values imputed by EIA.

interchange_reported_mwh

Original reported energy interchange between adjacent balancing authorities.

interchange_adjusted_mwh

Energy interchange between adjacent balancing authorities, adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.

interchange_imputed_eia_mwh

Energy interchange between adjacent balancing authorities, with outliers and missing values imputed by EIA.

demand_reported_mwh

Originally reported electricity demand, calculated by taking the net generation within the BA and subtracting the interchange with adjacent BAs.

demand_adjusted_mwh

Electricity demand adjusted by EIA to reflect non-physical commercial transfers through pseudo-ties and dynamic scheduling.

demand_imputed_eia_mwh

Electricity demand calculated by subtracting BA interchange from net generation, with outliers and missing values imputed by EIA.

demand_forecast_mwh

Day ahead demand forecast.

core_eia930__hourly_subregion_demand

package: pudl

Hourly time series of balancing authority subregion demand.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Form 930 -- Hourly and Daily Balancing Authority Operations Report

Primary key:

datetime_utc, balancing_authority_code_eia, balancing_authority_subregion_code_eia

Additional Details

For select BAs, balancing authority subregion demand provides a further level of geographic disaggregation (for example, load zones, weather zones, operating company areas, local BAs, etc.) within a BA's electric system.

Columns
datetime_utc

Timestamp at the end of the hour for which the data is reported.

balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_subregion_code_eia

Code identifying subregions of larger balancing authorities.

demand_reported_mwh

Originally reported electricity demand for the balancing area subregion. Note that different BAs have different methods of calculating and allocating subregion demand.

core_eia__codes_averaging_periods

package: pudl

Code table containing descriptions of categorical codes for the averaging period specified by emissions statutes and regulations for the EIA 860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_balancing_authorities

package: pudl

Code table containing descriptions of categorical codes for balancing authorities in EIA 860, EIA 923, and EIA 930.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

balancing_authority_region_code_eia

EIA balancing authority region code.

balancing_authority_region_name_eia

Human-readable name of the EIA balancing region.

report_timezone

Timezone used by the reporting entity. For use in localizing UTC times.

balancing_authority_retirement_date

Date on which the balancing authority ceased independent operation.

is_generation_only

Indicates whether the balancing authority is generation-only, meaning it does not serve retail customers and thus reports only net generation and interchange, but not demand.

interconnect_code_eia

EIA interconnect code.

core_eia__codes_balancing_authority_subregions

package: pudl

Code table containing descriptions of categorical codes for balancing authority subregions in EIA 930.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

balancing_authority_code_eia, balancing_authority_subregion_code_eia

Columns
balancing_authority_code_eia

EIA short code identifying a balancing authority. May include Canadian and Mexican BAs.

balancing_authority_subregion_code_eia

Code identifying subregions of larger balancing authorities.

balancing_authority_subregion_name_eia

Name of the balancing authority subregion.

core_eia__codes_boiler_generator_assn_types

package: pudl

Code table containing descriptions of categorical codes for boiler-generator associations in the EIA 860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_boiler_status

package: pudl

Code table containing descriptions of categorical codes for boiler status in the EIA 860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_boiler_types

package: pudl

Code table containing descriptions of categorical codes for boiler regulatory types in the EIA 860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_coalmine_types

package: pudl

Code table containing descriptions of categorical codes for coalmines reported as fuel sources in the EIA-923.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_contract_types

package: pudl

Code table containing descriptions of categorical codes for fuel supply contracts reported in EIA-923.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_cooling_system_types

package: pudl

Code table containing descriptions of categorical codes for cooling systems.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Usage Warnings

  • Contains information from multiple raw inputs.

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_cooling_tower_types

package: pudl

Code table containing descriptions of categorical codes for cooling towers.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_cooling_water_sources

package: pudl

Code table containing descriptions of categorical codes for cooling water sources in EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_cooling_water_types

package: pudl

Code table containing descriptions of categorical codes for cooling water.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_emission_control_equipment_types

package: pudl

Code table containing descriptions of categorical codes for emissions control equipment installed on a boiler.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_energy_sources

package: pudl

Code table containing descriptions of categorical codes for energy sources reported to EIA.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Usage Warnings

  • Contains information from multiple raw inputs.

Additional Details

Compiled from EIA-860 instructions and EIA-923 file layout spreadsheets.

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

fuel_units

Reported unit of measure for fuel.

min_fuel_mmbtu_per_unit

Minimum heat content per physical unit of fuel in MMBtu.

max_fuel_mmbtu_per_unit

Maximum heat content per physical unit of fuel in MMBtu.

fuel_group_eia

High level fuel group defined in the 2021-2023 EIA Form 860 instructions, Table 28.

fuel_derived_from

Original fuel from which this refined fuel was derived.

fuel_phase

Physical phase of matter of the fuel.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_environmental_equipment_manufacturers

package: pudl

Code table containing descriptions of categorical codes for manufacturers of boilers and environmental control equipment in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_firing_types

package: pudl

Code table containing descriptions of categorical codes for boiler firing types in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_fuel_transportation_modes

package: pudl

Code table containing descriptions of categorical codes for fuel transportation modes reported in the EIA-860 and EIA-923.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Usage Warnings

  • Contains information from multiple raw inputs.

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_fuel_types_agg

package: pudl

Code table containing descriptions of categorical codes for aggregated fuel types used in the Annual Energy Review or Monthly Energy Review.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Additional Details

See EIA-923 Fuel Code table for additional information.

Columns
code

Originally reported short code.

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_mercury_compliance_strategies

package: pudl

Code table containing descriptions of categorical codes for compliance strategies used to control mercury in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_momentary_interruptions

package: pudl

Code table containing descriptions of categorical codes for utility definitions of momentary service interruptions.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_nox_compliance_strategies

package: pudl

Code table containing descriptions of categorical codes for compliance strategies used to control nitrogen oxide in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_nox_control_status

package: pudl

Code table containing descriptions of categorical codes for the operational status of nitrogen oxide control units associated with boilers in the EIA-860 data.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_nox_units

package: pudl

Code table containing descriptions of categorical codes for units of measurement for nitrogen oxide in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_operational_status

package: pudl

Code table containing descriptions of categorical codes for operational status reported to EIA.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Usage Warnings

  • Contains information from multiple raw inputs.

Additional Details

Compiled from EIA-860 instructions and EIA-923 file layout spreadsheets.

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

operational_status

The operating status of the asset. For generators this is based on which tab the generator was listed in in EIA 860.

core_eia__codes_particulate_compliance_strategies

package: pudl

Code table containing descriptions of categorical codes for compliance strategies used to control particulate matter in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_particulate_units

package: pudl

Code table containing descriptions of categorical codes for units of measurement for particulate matter in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_prime_movers

package: pudl

Code table containing descriptions of categorical codes for prime movers reported in the EIA-860 and EIA-923.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Usage Warnings

  • Contains information from multiple raw inputs.

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_regulations

package: pudl

Code table containing descriptions of categorical codes for levels of statutes and codes under which boilers operate in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_reporting_frequencies

package: pudl

Code table containing descriptions of categorical codes for the reporting frequencies used by plants in the EIA-923.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_sector_consolidated

package: pudl

Code table containing descriptions of categorical codes for EIA consolidated NAICS sectors.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Usage Warnings

  • Contains information from multiple raw inputs.

Additional Details

Codes and descriptions taken from the EIA-923 File Layout spreadsheet.

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_so2_compliance_strategies

package: pudl

Code table containing descriptions of categorical codes for compliance strategies used to control sulfur dioxide in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_so2_units

package: pudl

Code table containing descriptions of categorical codes for units of measurement for sulfur dioxide in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_sorbent_types

package: pudl

Code table containing descriptions of categorical codes for flue gas desulfurization sorbent types used in EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_steam_plant_types

package: pudl

Code table containing descriptions of categorical codes for steam plants in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_storage_enclosure_types

package: pudl

Code table containing descriptions of categorical codes for energy storage enclosures.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_storage_technology_types

package: pudl

Code table containing descriptions of categorical codes for energy storage technologies.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_wet_dry_bottom

package: pudl

Code table containing descriptions of categorical codes for boiler bottoms in the EIA-860.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_eia__codes_wind_quality_class

package: pudl

Code table containing descriptions of categorical codes for wind quality classes.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

code

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

wind_speed_avg_ms

Average wind speed in meters per second.

extreme_fifty_year_gust_ms

The extreme 50-year wind gusts at this generator in meters per hour.

turbulence_intensity_a

The upper bounds of the turbulence intensity at the wind site (ratio of standard deviation of fluctuating wind velocity to the mean wind speed).

turbulence_intensity_b

The lower bounds of the turbulence intensity at the wind site (ratio of standard deviation of fluctuating wind velocity to the mean wind speed).

core_eia__entity_boilers

package: pudl

Entity table containing static information about boilers compiled from the EIA-860 and EIA-923.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

plant_id_eia, boiler_id

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This is one of two tables where canonical values for boilers are set. It contains values which are expected to remain fixed, while core_eia860__scd_boilers contains those which may vary from year to year. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

boiler_id

Alphanumeric boiler ID.

boiler_manufacturer

Name of boiler manufacturer.

boiler_manufacturer_code

EIA short code for boiler manufacturer.

core_eia__entity_generators

package: pudl

Entity table containing static information about generators compiled from across the EIA-860 and EIA-923.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

plant_id_eia, generator_id

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This is one of two tables where canonical values for generators are set. It contains values which are expected to remain fixed, while core_eia860__scd_generators contains those which may vary from year to year. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

duct_burners

Indicates whether the unit has duct-burners for supplementary firing of the turbine exhaust gas

generator_operating_date

Date the generator began commercial operation. If harvested values are inconsistent, we default to using the most recently reported date.

topping_bottoming_code

If the generator is associated with a combined heat and power system, indicates whether the generator is part of a topping cycle or a bottoming cycle

solid_fuel_gasification

Indicates whether the generator is part of a solid fuel gasification system

pulverized_coal_tech

Indicates whether the generator uses pulverized coal technology

fluidized_bed_tech

Indicates whether the generator uses fluidized bed technology

subcritical_tech

Indicates whether the generator uses subcritical technology

supercritical_tech

Indicates whether the generator uses supercritical technology

ultrasupercritical_tech

Indicates whether the generator uses ultra-supercritical technology

stoker_tech

Indicates whether the generator uses stoker technology

other_combustion_tech

Indicates whether the generator uses other combustion technologies

bypass_heat_recovery

Can this generator operate while bypassing the heat recovery steam generator?

rto_iso_lmp_node_id

The designation used to identify the price node in RTO/ISO Locational Marginal Price reports

rto_iso_location_wholesale_reporting_id

The designation used to report the specific location of the wholesale sales transactions to FERC for the Electric Quarterly Report

associated_combined_heat_power

Indicates whether the generator is associated with a combined heat and power system

original_planned_generator_operating_date

The date the generator was originally scheduled to be operational

can_switch_when_operating

Indicates whether a fuel switching generator can switch fuels while operating.

previously_canceled

Indicates whether the generator was previously reported as indefinitely postponed or canceled

core_eia__entity_plants

package: pudl

Entity table containing static information about plants, compiled from across all EIA-860 and EIA-923 data.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

plant_id_eia

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This is one of two tables where canonical values for plants are set. It contains values which are expected to remain fixed, while core_eia860__scd_plants contains those which may vary from year to year. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_name_eia

Plant name.

city

Name of the city.

county

County name.

latitude

Latitude of the plant's location, in degrees.

longitude

Longitude of the plant's location, in degrees.

state

Two letter US state abbreviation.

street_address

Physical street address.

zip_code

Five digit US Zip Code.

timezone

IANA timezone name

core_eia__entity_utilities

package: pudl

Entity table containing static information about utilities, compiled from all EIA data.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

utility_id_eia

Usage Warnings

  • Data has been drawn from several EIA sources which are not always consistent with each other, and PUDL chooses the most consistent or relevant value to facilitate cross-referencing even if that means some values will differ from the raw sources. See Harvesting for details.

  • Contains information from multiple raw inputs.

Additional Details

This is one of two tables where canonical values for utilities are set. It contains values which are expected to remain fixed, while core_eia860__scd_utilities contains those which may vary from year to year. EIA reports many attributes in many different tables across EIA-860 and EIA-923. In order to compile tidy, well-normalized database tables, PUDL collects all instances of these values and and chooses a canonical value. By default, PUDL chooses the most consistently reported value of a given attribute as long as it is at least 70% of the given instances reported. If an attribute was reported inconsistently across the original EIA tables, then it will show up as a null value. All tables downstream of this one inherit the canonical values established here.

Columns
utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

core_eia__yearly_fuel_receipts_costs_aggs

package: pudl

Annual time series of aggregated fuel receipts and costs from the EIA API.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA -- Mix of multiple EIA Forms

Primary key:

fuel_agg, geo_agg, sector_agg, temporal_agg, report_date

Columns
fuel_agg

Category of fuel aggregation in EIA bulk electricity data.

geo_agg

Category of geographic aggregation in EIA bulk electricity data.

sector_agg

Category of sectoral aggregation in EIA bulk electricity data.

temporal_agg

Category of temporal aggregation in EIA bulk electricity data.

report_date

Date reported.

fuel_received_mmbtu

Aggregated fuel receipts, in MMBtu, in EIA bulk electricity data.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

core_eiaaeo__yearly_projected_electric_sales

package: pudl

Annual time series of projected electricity sales by region and customer class.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Annual Energy Outlook (AEO)

Primary key:

report_year, electricity_market_module_region_eiaaeo, model_case_eiaaeo, projection_year, customer_class

Columns
report_year

Four-digit year in which the data was reported.

electricity_market_module_region_eiaaeo

AEO projection region.

model_case_eiaaeo

Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.

projection_year

The year of the projected value.

customer_class

High level categorization of customer type (e.g., commercial, residential).

sales_mwh

Quantity of electricity sold in MWh.

core_eiaaeo__yearly_projected_energy_use_by_sector_and_type

package: pudl

Annual time series of projected energy use for commercial, electric power, industrial, residential, and transportation sectors, drawn from AEO Table 2.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Annual Energy Outlook (AEO)

Primary key:

report_year, region_name_eiaaeo, model_case_eiaaeo, projection_year, energy_use_sector, energy_use_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

Additional Details

The series in Table 2 which track energy use by sector do not always define each type of use the same way across sectors. There is detailed information about what is included or excluded in each use type for each sector in the footnotes of the EIA's online AEO data browser:

https://www.eia.gov/outlooks/aeo/data/browser/#/?id=2-AEO2023

Use caution when aggregating across use types! Energy Use has a tricky system of subtotals, and summing all types within a sector will result in double-counting. Consult the EIA's data browser for visibility into which use types are subtotals, and what they contain: subtotal series are displayed indented, and include all lines above them which are one level out, up to the next indented line. Delivered Energy and Total are special cases which include those plus all subtotals above. In this way, "Delivered Energy" includes purchased electricity, renewable energy, and an array of fuels based on sector, and explicitly excludes electricity-related losses.

AEO Energy Use figures are variously referred to as delivered energy, energy consumption, energy use, and energy demand, depending on which use types are being discussed, and which org and which document is describing them. In PUDL we say energy use or energy consumption.

Columns
report_year

Four-digit year in which the data was reported.

region_name_eiaaeo

EIA AEO region for energy consumption. Includes US Census Divisions plus United States.

region_type_eiaaeo

Region type for EIA AEO energy consumption, indicating whether region_name_eiaaeo is a US Census Division or country (United States)

model_case_eiaaeo

Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.

projection_year

The year of the projected value.

energy_use_sector

Sector for energy use figures in AEO Table 2. Similar to customer class, but with some missing and some extra values.

energy_use_type

Type of energy use, indicating the name of the series from AEO Table 2. Includes fuels, electricity, losses, and various subtotals; consult table documentation for aggregation guidelines.

energy_use_mmbtu

Energy use, in MMBtu; also referred to as energy consumption, energy demand, or delivered energy, depending on type.

core_eiaaeo__yearly_projected_fuel_cost_in_electric_sector_by_type

package: pudl

Annual time series of projected fuel prices for the electric power sector, including electricity-only and combined-heat-and-power plants that have a regulatory status.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Annual Energy Outlook (AEO)

Primary key:

report_year, electricity_market_module_region_eiaaeo, model_case_eiaaeo, projection_year, fuel_type_eiaaeo

Columns
report_year

Four-digit year in which the data was reported.

electricity_market_module_region_eiaaeo

AEO projection region.

model_case_eiaaeo

Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.

projection_year

The year of the projected value.

fuel_type_eiaaeo

Fuel type reported for AEO end-use sector generation data.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

fuel_cost_real_per_mmbtu_eiaaeo

Average fuel cost per mmBTU of heat content in real USD, standardized to the value of a USD in the year defined by real_cost_basis_year.

real_cost_basis_year

Four-digit year which is the basis for any 'real cost' monetary values (as opposed to nominal values).

core_eiaaeo__yearly_projected_generation_in_electric_sector_by_technology

package: pudl

Annual time series of projected generation capacity & total generation in the electric sector, broken out by technology.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Annual Energy Outlook (AEO)

Primary key:

report_year, electricity_market_module_region_eiaaeo, model_case_eiaaeo, projection_year, technology_description_eiaaeo

Columns
report_year

Four-digit year in which the data was reported.

electricity_market_module_region_eiaaeo

AEO projection region.

model_case_eiaaeo

Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.

projection_year

The year of the projected value.

technology_description_eiaaeo

Generation technology reported for AEO.

summer_capacity_mw

The net summer capacity.

summer_capacity_planned_additions_mw

The total planned additions to net summer generating capacity.

summer_capacity_unplanned_additions_mw

The total unplanned additions to net summer generating capacity.

summer_capacity_retirements_mw

The total retirements from net summer generating capacity.

gross_generation_mwh

Gross electricity generation for the specified period in megawatt-hours (MWh).

core_eiaaeo__yearly_projected_generation_in_end_use_sectors_by_fuel_type

package: pudl

Annual time series of projected generation capacity and total generation in the end-use sector, broken out by fuel type.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EIA Annual Energy Outlook (AEO)

Primary key:

report_year, electricity_market_module_region_eiaaeo, model_case_eiaaeo, projection_year, fuel_type_eiaaeo

Usage Warnings

  • The solar photovoltaic capacity portion of Renewable Sources is reported in megawatts direct current; other technologies are reported in megawatts of alternating current.

Additional Details

Includes combined-heat-and-power plants and electricity-only plants in the commercial and industrial sectors; and small on-site generating systems in the residential, commercial, and industrial sectors used primarily for own-use generation, but which may also sell some power to the grid.

Columns
report_year

Four-digit year in which the data was reported.

electricity_market_module_region_eiaaeo

AEO projection region.

model_case_eiaaeo

Factors such as economic growth, future oil prices, the ultimate size of domestic energy resources, and technological change are often uncertain. To illustrate some of these uncertainties, EIA runs side cases to show how the model responds to changes in key input variables compared with the Reference case. See https://www.eia.gov/outlooks/aeo/assumptions/case_descriptions.php for more details.

projection_year

The year of the projected value.

fuel_type_eiaaeo

Fuel type reported for AEO end-use sector generation data.

summer_capacity_mw

The net summer capacity.

gross_generation_mwh

Gross electricity generation for the specified period in megawatt-hours (MWh).

core_epa__assn_eia_epacamd

package: pudl

Association table providing connections between EPA units and EIA plants, boilers, and generators.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EPA -- Mix of multiple EPA sources

Primary key:

This table has no primary key.

Additional Details

This crosswalk table comes from the PUDL fork of the EPA camd-eia-crosswalk Github repo: https://github.com/catalyst-cooperative/camd-eia-crosswalk-latest.

The camd-eia-crosswalk README and our Data Source documentation page on ../data_sources/epacems depict the complicated relationship between EIA and EPA data, specifically the nature of EPA vs. EIA "units" and the level of granularity that one can connect the two sources.

The original EPA crosswalk runs on 2018 EIA data. We adapted the crosswalk code to run on each new year of EIA data, capturing changes in plant information over time.

Our version of the crosswalk clarifies some of the column names and removes unmatched rows. The pudl.etl.glue_assets.core_epa__assn_eia_epacamd function doc strings explain what changes are made from the EPA's version.

Columns
report_year

Four-digit year in which the data was reported.

plant_id_epa

The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.

emissions_unit_id_epa

Emissions (smokestack) unit monitored by EPA CEMS.

generator_id_epa

Generator ID used by the EPA.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

boiler_id

Alphanumeric boiler ID.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

core_epa__assn_eia_epacamd_subplant_ids

package: pudl

Association table providing connections between EPA units and EIA units/generators, at the subplant level.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EPA -- Mix of multiple EPA sources

Primary key:

This table has no primary key. The primary keys would have been: plant_id_eia, generator_id, subplant_id and emissions_unit_id_epa, but there are some null records in the generator_id column. ~2 percent of all EPA CAMD records are not successfully mapped to EIA generators.

Additional Details

This table is an augmented version of the core_epa__assn_eia_epacamd crosswalk table which initially comes from the EPA's Github repo camd-eia-crosswalk: https://github.com/USEPA/camd-eia-crosswalk.

This table identifies subplants within plant_ids, which are the smallest coherent units for aggregation. A plant_id refers to a legal entity that often contains multiple distinct power plants, even of different technology or fuel types.

EPA CEMS data combines information from several parts of a power plant:

  • emissions from smokestacks

  • fuel use from combustors

  • electricity production from generators

But smokestacks, combustors, and generators can be connected in complex, many-to-many relationships. This complexity makes attribution difficult for, as an example, allocating pollution to energy producers. Furthermore, heterogeneity within plant_ids make aggregation to the parent entity difficult or inappropriate.

This table inherits from the EPA's crosswalk, the IDs from EPA CAMD core_epacems__hourly_emissions table itself, the core_eia860__assn_boiler_generator table and the core_eia860__scd_generators table. While the core_epa__assn_eia_epacamd table is the core backbone of the table, EPA CAMD IDs ensure there is complete coverage of EPA CAMD reporting units. The EIA 860 table addition ensures there is also complete coverage of those units as well.

For more information about the how this subplant_id is made, see the documentation for pudl.etl.glue_assets.make_subplant_ids and pudl.etl.glue_assets.update_subplant_ids.

But by analyzing the relationships between combustors and generators, as provided in the core_epa__assn_eia_epacamd crosswalk, we can identify distinct power plants. These are the smallest coherent units of aggregation.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_epa

The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.

subplant_id

Sub-plant ID links EPA CEMS emissions units to EIA units.

unit_id_pudl

Dynamically assigned PUDL unit id. WARNING: This ID is not guaranteed to be static long term as the input data and algorithm may evolve over time.

emissions_unit_id_epa

Emissions (smokestack) unit monitored by EPA CEMS.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

core_epacems__hourly_emissions

package: pudl

Hourly time series of emissions and plant operational data as required by 40 CFR Part 75.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

EPA Hourly Continuous Emission Monitoring System (CEMS)

Primary key:

plant_id_epa, emissions_unit_id_epa, operating_datetime_utc

Usage Warnings

  • Large table; do not attempt to open with Excel.

  • Not all IDs are present.

  • Contains information from multiple raw inputs.

Additional Details

Continuous Emissions Monitoring Systems (CEMS) are used to determine the rate of gas or particulate matter exiting a point source of emissions. The EPA Clean Air Markets Division (CAMD) has collected data on power plant emissions from CEMS units stretching back to 1995. The CEMS dataset includes hourly gross load, SO2, CO2, and NOx emissions associated with a given emissions-unit during startup, shutdown, and instances of malfunction. An EPA CEMS emissions-unit or smokestack unit is not the same as an EIA unit. See core_epa__assn_eia_epacamd for details about how to connect CEMS data to corresponding EIA units.

Only fossil-combustion units over 25 MW are required to install and use CEMS, so there are some units that do report in EIA-860 or EIA-923 that do not have any CEMS data.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_id_epa

The ORISPL ID used by EPA to refer to the plant. Usually but not always the same as plant_id_eia.

emissions_unit_id_epa

Emissions (smokestack) unit monitored by EPA CEMS.

operating_datetime_utc

Date and time measurement began (UTC).

year

Year the data was reported in, used for partitioning EPA CEMS.

state

Two letter US state abbreviation.

operating_time_hours

Length of time interval measured.

gross_load_mw

Average power in megawatts delivered during time interval measured.

heat_content_mmbtu

The energy contained in fuel burned, measured in million BTU.

steam_load_lbs

Total steam pressure produced by a unit during the reported hour.

so2_mass_lbs

Sulfur dioxide emissions in pounds.

so2_mass_measurement_code

Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.

nox_mass_lbs

NOx emissions in pounds.

nox_mass_measurement_code

Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.

co2_mass_tons

Carbon dioxide emissions in short tons.

co2_mass_measurement_code

Identifies whether the reported value of emissions was measured, calculated, or measured and substitute.

core_ferc1__codes_power_purchase_types

package: pudl

Code table containing descriptions of categorical codes for electric power purchase types.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities

Primary key:

code

Usage Warnings

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
code

Originally reported short code.

label

Longer human-readable code using snake_case

description

Long human-readable description of the meaning of a code/label.

core_ferc1__yearly_balance_sheet_assets_sched110

package: pudl

Annual time series of utility assets and other debits.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110)

Primary key:

utility_id_ferc1, report_year, asset_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

asset_type

Type of asset being reported to the core_ferc1__yearly_balance_sheet_assets_sched110 table.

ending_balance

Account balance at end of year.

starting_balance

Account balance at beginning of year.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

core_ferc1__yearly_balance_sheet_liabilities_sched110

package: pudl

Annual time series of utility liabilities and other credits.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 110)

Primary key:

utility_id_ferc1, report_year, liability_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

starting_balance

Account balance at beginning of year.

ending_balance

Account balance at end of year.

liability_type

Type of liability being reported to the core_ferc1__yearly_balance_sheet_liabilities_sched110 table.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

core_ferc1__yearly_cash_flows_sched120

package: pudl

Annual time series of utility cash flow.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 120)

Primary key:

utility_id_ferc1, report_year, amount_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

amount_type

Label describing the type of amount being reported. This could be a balance or a change in value.

amount

Reported amount of dollars. This could be a balance or a change in value.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

core_ferc1__yearly_depreciation_by_function_sched219

package: pudl

Annual time series of ending balances in accumulated provision for depreciation of electric utility plant.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 219 - Section B)

Primary key:

utility_id_ferc1, report_year, depreciation_type, plant_function, plant_status, utility_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric utility plant refers to FERC Account 108. Section B includes the Balances at End of Year According to Functional Classification (plant_function).

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

depreciation_type

Type of depreciation provision within FERC Account 108, including cost ofremoval, depreciation expenses, salvage, cost of retired plant, etc.

plant_function

Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).

plant_status

Utility plant financial status (in service, future, leased, total).

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

ending_balance

Account balance at end of year.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

core_ferc1__yearly_depreciation_changes_sched219

package: pudl

Annual time series of changes in accumulated provision for depreciation of electric utility plant.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 219 - Section A)

Primary key:

utility_id_ferc1, report_year, depreciation_type, plant_status, utility_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric utility plant refers to FERC Account 108.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

depreciation_type

Type of depreciation provision within FERC Account 108, including cost ofremoval, depreciation expenses, salvage, cost of retired plant, etc.

plant_status

Utility plant financial status (in service, future, leased, total).

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

dollar_value

Dollar value of reported income, expense, asset, or liability.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

core_ferc1__yearly_depreciation_summary_sched336

package: pudl

Annual time series of depreciation and amortization of electric plant.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 336 - Section A)

Primary key:

utility_id_ferc1, report_year, plant_function, ferc_account_label

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric Plant refers to FERC Accounts 403, 404, and 405. This table only contains information from Section A: Summary of depreciation and amortization changes.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

report_year

Four-digit year in which the data was reported.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

plant_function

Functional role played by utility plant (steam production, nuclear production, distribution, transmission, etc.).

ferc_account_label

Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

dollar_value

Dollar value of reported income, expense, asset, or liability.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

core_ferc1__yearly_energy_dispositions_sched401

package: pudl

Annual time series of dispositions of electric energy sold, exchanged, or stored.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 401a)

Primary key:

utility_id_ferc1, report_year, energy_disposition_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric Energy Account, dispositions only. Schedule 401a. Electricity utilities delivered to end users, internal losses, etc.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

energy_disposition_type

Type of energy disposition reported in the core_ferc1__yearly_energy_dispositions_sched401. Dispositions include sales to customers, re-sales of energy, energy used internally, losses, etc.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

energy_mwh

Sources and uses of energy in MWh.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

core_ferc1__yearly_energy_sources_sched401

package: pudl

Annual time series of sources of electric energy generated or purchased, exchanged and wheeled.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 401a)

Primary key:

utility_id_ferc1, report_year, energy_source_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Electric Energy Account, sources only. Schedule 401a. Amount of electricity the utility obtained from each of several sources.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

energy_source_type

Type of energy source reported in the core_ferc1__yearly_energy_sources_sched401 table. There are three groups of energy sources: generation, power exchanges and transmission.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

energy_mwh

Sources and uses of energy in MWh.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

core_ferc1__yearly_hydroelectric_plants_sched406

package: pudl

Annual time series of plant statistics for large hydroelectric generating plants.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 406)

Primary key:

This table has no primary key. The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.

Usage Warnings

  • The plant_type and construction_type are standardized into categorical values from free-form strings.

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Large plants have an installed nameplate capacity of more than 10 MW.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

project_num

FERC Licensed Project Number.

plant_type

Type of plant.

construction_type

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

Year the plant's oldest still operational unit was built.

installation_year

Year the plant's most recently built unit was installed.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_hours_connected_while_generating

Hours the plant was connected to load while generating in the report year.

net_capacity_favorable_conditions_mw

Net plant capability under the most favorable operating conditions, in megawatts.

net_capacity_adverse_conditions_mw

Net plant capability under the least favorable operating conditions, in megawatts.

avg_num_employees

The average number of employees assigned to each plant.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

capex_land

Cost of plant: land and land rights (USD).

capex_structures

Cost of plant: structures and improvements (USD).

capex_facilities

Cost of plant: reservoirs, dams, and waterways (USD).

capex_equipment

Cost of plant: equipment (USD).

capex_roads

Cost of plant: roads, railroads, and bridges (USD).

asset_retirement_cost

Asset retirement cost (USD).

capex_total

Total cost of plant (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_water_for_power

Production expenses: water for power (USD).

opex_hydraulic

Production expenses: hydraulic expenses (USD).

opex_electric

Production expenses: electric expenses (USD).

opex_generation_misc

Production expenses: miscellaneous power generation expenses (USD).

opex_rents

Production expenses: rents (USD).

opex_engineering

Production expenses: maintenance, supervision, and engineering (USD).

opex_structures

Production expenses: maintenance of structures (USD).

opex_dams

Production expenses: maintenance of reservoirs, dams, and waterways (USD).

opex_plant

Production expenses: maintenance of electric plant (USD).

opex_misc_plant

Production expenses: maintenance of miscellaneous hydraulic plant (USD).

opex_total

Total production expenses, excluding fuel (USD).

opex_per_mwh

Total production expenses (USD per MWh generated).

core_ferc1__yearly_income_statements_sched114

package: pudl

Annual time series of utility income statements.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 114)

Primary key:

utility_id_ferc1, report_year, utility_type, income_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

income_type

Type of income reported in core_ferc1__yearly_income_statements_sched114 table.

dollar_value

Dollar value of reported income, expense, asset, or liability.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

core_ferc1__yearly_operating_expenses_sched320

package: pudl

Annual time series of operating and maintenance costs associated with producing electricity.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 320)

Primary key:

utility_id_ferc1, report_year, expense_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

dollar_value

Dollar value of reported income, expense, asset, or liability.

expense_type

The type of expense.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

core_ferc1__yearly_operating_revenues_sched300

package: pudl

Annual time series of utilities' electric operating revenues.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 300)

Primary key:

utility_id_ferc1, report_year, revenue_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table includes only the structured part of schedule 300. There are a number of revenue_type's that do not have sales_mwh,or avg_customers_per_month provided, in which case these columnswill be NULL.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

revenue_type

Label describing types of revenues.

dollar_value

Dollar value of reported income, expense, asset, or liability.

sales_mwh

Quantity of electricity sold in MWh.

avg_customers_per_month

Average number of customers per month.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

core_ferc1__yearly_other_regulatory_liabilities_sched278

package: pudl

Annual time series of utilities' other regulatory liabilities, including rate order docket number.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 278)

Primary key:

This table has no primary key. Respondents are able to enter any number of liabilities across many rows. There are no IDs or set fields enforced in the original table.

Usage Warnings

  • The description column is a free-form string.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

description

Long human-readable description of the meaning of a code/label.

ending_balance

Account balance at end of year.

starting_balance

Account balance at beginning of year.

increase_in_other_regulatory_liabilities

The increase during the reporting period of other regulatory liabilities.

account_detail

Description of the account number credited from making debit adjustment to other regulatory liabilities.

decrease_in_other_regulatory_liabilities

The decrease during the reporting period of other regulatory liabilities.

core_ferc1__yearly_plant_in_service_sched204

package: pudl

Annual time series of utilities' balances and changes to FERC Electric Plant in Service accounts.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 204)

Primary key:

utility_id_ferc1, report_year, ferc_account_label

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Account numbers correspond to the FERC Uniform System of Accounts for Electric Plant, which is defined in Code of Federal Regulations (CFR) Title 18, Chapter I, Subchapter C, Part 101. (See e.g. https://www.law.cornell.edu/cfr/text/18/part-101). Each FERC respondent reports starting and ending balances for each account annually. Balances are organization wide, and are not broken down on a per-plant basis. End of year balance should equal beginning year balance plus the sum of additions, retirements, adjustments, and transfers.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

ferc_account_label

Long FERC account identifier derived from values reported in the XBRL taxonomies. May also refer to aggregations of individual FERC accounts.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

starting_balance

Account balance at beginning of year.

additions

Cost of acquisition of items classified within the account.

retirements

Cost of disposal of items classified within the account.

adjustments

Cost of adjustments to the account.

transfers

Cost of transfers into (out of) the account.

ending_balance

Account balance at end of year.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

plant_status

Utility plant financial status (in service, future, leased, total).

core_ferc1__yearly_pumped_storage_plants_sched408

package: pudl

Annual time series of plant statistics for hydroelectric pumped storage plants with an installed nameplate capacity of 10+ MW.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 408)

Primary key:

This table has no primary key. The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.

Usage Warnings

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

As reported in Schedule 408 of FERC Form 1.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

project_num

FERC Licensed Project Number.

construction_type

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

Year the plant's oldest still operational unit was built.

installation_year

Year the plant's most recently built unit was installed.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_hours_connected_while_generating

Hours the plant was connected to load while generating in the report year.

plant_capability_mw

Net plant capability in megawatts.

avg_num_employees

The average number of employees assigned to each plant.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

energy_used_for_pumping_mwh

Energy used for pumping, in megawatt-hours.

net_load_mwh

Net output for load (net generation - energy used for pumping) in megawatt-hours.

capex_land

Cost of plant: land and land rights (USD).

capex_structures

Cost of plant: structures and improvements (USD).

capex_facilities

Cost of plant: reservoirs, dams, and waterways (USD).

capex_wheels_turbines_generators

Cost of plant: water wheels, turbines, and generators (USD).

capex_equipment_electric

Cost of plant: accessory electric equipment (USD).

capex_equipment_misc

Cost of plant: miscellaneous power plant equipment (USD).

capex_roads

Cost of plant: roads, railroads, and bridges (USD).

asset_retirement_cost

Asset retirement cost (USD).

capex_total

Total cost of plant (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_water_for_power

Production expenses: water for power (USD).

opex_pumped_storage

Production expenses: pumped storage (USD).

opex_electric

Production expenses: electric expenses (USD).

opex_generation_misc

Production expenses: miscellaneous power generation expenses (USD).

opex_rents

Production expenses: rents (USD).

opex_engineering

Production expenses: maintenance, supervision, and engineering (USD).

opex_structures

Production expenses: maintenance of structures (USD).

opex_dams

Production expenses: maintenance of reservoirs, dams, and waterways (USD).

opex_plant

Production expenses: maintenance of electric plant (USD).

opex_misc_plant

Production expenses: maintenance of miscellaneous hydraulic plant (USD).

opex_production_before_pumping

Total production expenses before pumping (USD).

opex_pumping

Production expenses: We are here to PUMP YOU UP! (USD).

opex_total

Total production expenses, excluding fuel (USD).

opex_per_mwh

Total production expenses (USD per MWh generated).

core_ferc1__yearly_purchased_power_and_exchanges_sched326

package: pudl

Annual time series of purchased power (Account 555) including power exchanges (transactions involving a balancing of debits and credits for energy, capacity, etc.) and any settlements for imbalanced exchanges.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 326)

Primary key:

This table has no primary key. The primary key for this table would be record_id, however a small number of these values are duplicated in 2021, 2022, and 2024.

Usage Warnings

  • Contains columns which may appear categorical, but are actually free text.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table has data about inter-utility power purchases. This includes how much electricity was purchased, how much it cost, and who it was purchased from. Unfortunately the field describing which other utility the power was being bought from (seller_name) is poorly standardized, making it difficult to correlate with other data.

Purchased Power is considered FERC Account 555 according to FERC's Uniform System of Accounts. Reported on pages 326-327 of FERC Form 1.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

seller_name

Name of the seller, or the other party in an exchange transaction.

purchase_type_code

Categorization based on the original contractual terms and conditions of the service. Must be one of 'requirements', 'long_firm', 'intermediate_firm', 'short_firm', 'long_unit', 'intermediate_unit', 'electricity_exchange', 'other_service', or 'adjustment'. Requirements service is ongoing high reliability service, with load integrated into system resource planning. 'Long term' means 5+ years. 'Intermediate term' is 1-5 years. 'Short term' is less than 1 year. 'Firm' means not interruptible for economic reasons. 'unit' indicates service from a particular designated generating unit. 'exchange' is an in-kind transaction.

tariff

FERC Rate Schedule Number or Tariff. (Note: may be incomplete if originally reported on multiple lines.)

billing_demand_mw

Monthly average billing demand (for requirements purchases, and any transactions involving demand charges). In megawatts.

non_coincident_peak_demand_mw

Average monthly non-coincident peak (NCP) demand (for requirements purhcases, and any transactions involving demand charges). Monthly NCP demand is the maximum metered hourly (60-minute integration) demand in a month. In megawatts.

coincident_peak_demand_mw

Average monthly coincident peak (CP) demand (for requirements purchases, and any transactions involving demand charges). Monthly CP demand is the metered demand during the hour (60-minute integration) in which the supplier's system reaches its monthly peak. In megawatts.

purchased_mwh

Megawatt-hours shown on bills rendered to the respondent. Includes both electricity purchased for storage and non-storage purposes, which were lumped together prior to 2021.

purchased_storage_mwh

Number of megawatt hours purchased during the period for energy storage.

purchased_other_than_storage_mwh

Number of megawatt hours purchased during the period for other than energy storage.

received_mwh

Gross megawatt-hours received in power exchanges and used as the basis for settlement.

delivered_mwh

Gross megawatt-hours delivered in power exchanges and used as the basis for settlement.

demand_charges

Demand charges (USD).

energy_charges

Energy charges (USD).

other_charges

Other charges, including out-of-period adjustments (USD).

total_settlement

Sum of demand, energy, and other charges (USD). For power exchanges, the settlement amount for the net receipt of energy. If more energy was delivered than received, this amount is negative.

core_ferc1__yearly_retained_earnings_sched118

package: pudl

Annual time series of utilities' statements of retained earnings.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 118)

Primary key:

utility_id_ferc1, report_year, earnings_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

earnings_type

Label describing types of earnings.

starting_balance

Account balance at beginning of year.

ending_balance

Account balance at end of year.

balance

Indication of whether a column is a credit or debit, as reported in the XBRL taxonomy.

ferc_account

Actual FERC Account number (e.g. '359.1') if available, or a PUDL assigned ID when FERC accounts have been split or combined in reporting.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

core_ferc1__yearly_sales_by_rate_schedules_sched304

package: pudl

Annual time series of utilities' electric sales from all rate schedules in effect throughout the year.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 304)

Primary key:

This table has no primary key.

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Values in rate_schedule_description are free-form strings.

  • Data prior to 2021 does not include information in columns: rate_schedule_type and billing_status.

  • Units of revenue_per_kwh are suspected to include a mix of dollars and possibly cents.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

The pre-2021 data in this table (extracted from FoxProDB vs. XBRL) is extremely unstructured. Where the post-2020 data (from XBRL) sorts the data into rate schedule types: residential, industrial, commercial, public_lighting, public_authorities, railroads, interdepartmental, provision_for_rate_refund, commercial_and_industrial, total, and billing status: billed, unbilled, total, the pre-2021 data stuffs all of that information (if you're lucky) into the rate_schedule_description column. There's no point trying to parse through the pre 2021 rate_schedule_description column en masse because it's just too messy. The contents of rate_schedule_description often contain numbers and acronyms that have little to no meaning out of context. The table is structured somewhat like the FERC1 small generators table with headings about rate structure type also embedded into the rate_schedule_description column. To all who dare, beware.

This table is a combination of one pre-2021 (DBF) table and nine post-2020 (XBRL) tables--one for each rate schedule type plus totals--hence increase in data clarity post-2020. The rate_schedule_type and billing_status columns are only relevant for post-2020 data as they can be reliably parsed from each of the tables and incorporated into columns. The rate_schedule_description is supposed to contain sub-rate_schedule_type names for charges (Ex: Residential 1, Residential 2, etc.). However, the pre-2021 data contains a little bit of everything (or nothing) and the post-2020 has some totals or wonky data thrown in. That's to say, even when working with post-2020 data, be wary of aggregating the data. That's what the "total" rows are for.

The values that come from from the totals table are marked with the string "total" in the rate_schedule_description column. The totals table is a product of the transition to XBRL, so these distinguishable totals are only available for data post-2020 (otherwise you could try keyword searching for "total" in rate_schedule_description). The total table contains two types of totals, the utility totals across all rate schedules in a given year (marked with rate_schedule_description = "total" and rate_schedule_type = "total") and each of the utility's individual rate schedule totals in a given year (marked with rate_schedule_description = "total" and rate_schedule_type = "residential" or any other rate schedule type).

The rate schedule based XBRL tables only report billed values whereas the total tables report billed, unbilled, and total values. (See the column description for more info on the difference between billed and unbilled). This is important to consider if you're endeavoring to compare the subtotal values with the total values. We have not attempted to fix or verify any subtotals or totals that don't add up.

Another important note is the possibility of unit discrepancies in certain columns. The revenue_per_kwh column does not specify reporting units, and closer inspection of the data reveals two clear peaks approximate two orders of magnitude apart. This indicates that values may be reported in both dollars and cents. However, because the price of energy per kwh varies so much regionally, we cannot guarantee which is which and have not put any cleaning mechanisms in place to account for this.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

rate_schedule_type

Categorization of rate schedule type.

billing_status

Whether an amount is billed, unbilled, or both. Billed amounts pertain to the exchange of energy and unbilled amounts pertain to other sources of revenue such as contracts with peaker plants to keep them on standby or charging rent to host cell antennas on transmission towers.

rate_schedule_description

Free-form description of what the rate schedule name is. Not standardized. Often a sub-category of rate_schedule_type.

sales_mwh

Quantity of electricity sold in MWh.

dollar_value

Dollar value of reported income, expense, asset, or liability.

avg_customers_per_month

Average number of customers per month.

kwh_per_customer

kWh per customer.

revenue_per_kwh

The amount of revenue per kWh by rate schedule acquired in the given report year.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

core_ferc1__yearly_small_plants_sched410

package: pudl

Annual time series of plant statistics for internal combustion plants, gas turbine-plants, conventional hydro plants, and pumped storage plants with less than 10 MW installed nameplate capacity and steam plants with less than 25 MW installed nameplate capacity.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 410)

Primary key:

This table has no primary key. The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.

Usage Warnings

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

As reported on FERC Form 1 Schedule 410 (pages 410-411) and extracted from the FERC Visual FoxPro and XBRL. See our pudl.extract.ferc1.TABLE_NAME_MAP_FERC1 for links to the raw tables.

The raw version of this table is more like a digitized PDF than an actual data table. The rows contain lots of information in addition to what the columns might suggest. For instance, a single column may contain header rows, note rows, and total rows. This extraneous information is useful, but it prevents proper analysis when mixed in with the rest of the values data in the column. We employ a couple of data transformations to extract these rows from the data and preserve some of the information they contain (fuel type, plant type, FERC license, or general notes about the plant) in separate columns.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

plant_type

Type of plant.

license_id_ferc1

FERC issued operating license ID for the facility, if available. This value is extracted from the original plant name where possible.

construction_year

Year the plant's oldest still operational unit was built.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

capex_total

Total cost of plant (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_fuel

Production expenses: fuel (USD).

opex_maintenance

Production expenses: Maintenance (USD).

fuel_type

Type of fuel.

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

core_ferc1__yearly_steam_plants_fuel_sched402

package: pudl

Annual time series of fuel cost and quantity for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)

Primary key:

This table has no primary key.

Usage Warnings

  • The fuel_type_code_pudl is inferred from a free-form string field.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table is a subset of the steam plant table reported on page 402 of FERC Form 1.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

fuel_type_code_pudl

Simplified fuel type code used in PUDL

fuel_units

Reported unit of measure for fuel.

fuel_consumed_units

Consumption of the fuel type in physical unit. Note: this is the total quantity consumed for both electricity and, in the case of combined heat and power plants, process steam production.

fuel_mmbtu_per_unit

Heat content of the fuel in millions of Btus per physical unit.

fuel_cost_per_unit_burned

Average cost of fuel consumed in the report year per reported fuel unit (USD).

fuel_cost_per_unit_delivered

Average cost of fuel delivered in the report year per reported fuel unit (USD).

fuel_cost_per_mmbtu

Average fuel cost per mmBTU of heat content in nominal USD.

core_ferc1__yearly_steam_plants_sched402

package: pudl

Annual time series of plant statistics for steam plants with a capacity of 25+ MW, internal combustion and gas-turbine plants of 10+ MW, and all nuclear plants.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 402)

Primary key:

This table has no primary key. The best approximation for primary keys for this table would be: report_year, utility_id_ferc1, plant_name_ferc1. FERC does not publish plant IDs. The main identifying column is plant_name_ferc1 but that is a free-form string field and there are duplicate records.

Usage Warnings

  • FERC does not restrict respondents to report unique and non-duplicative plant records. There are sporadic instances of respondents reporting portions of plants and then the total plant (ex: unit 1, unit 2 and total). Use caution when aggregating.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

plant_type

Type of plant.

construction_type

Type of plant construction ('outdoor', 'semioutdoor', or 'conventional'). Categorized by PUDL based on our best guess of intended value in FERC1 freeform strings.

construction_year

Year the plant's oldest still operational unit was built.

installation_year

Year the plant's most recently built unit was installed.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

peak_demand_mw

Net peak demand for 60 minutes. Note: in some cases peak demand for other time periods may have been reported instead, if hourly peak demand was unavailable.

plant_hours_connected_while_generating

Hours the plant was connected to load while generating in the report year.

plant_capability_mw

Net plant capability in megawatts.

water_limited_capacity_mw

Plant capacity in MW when limited by condenser water.

not_water_limited_capacity_mw

Plant capacity in MW when not limited by condenser water.

avg_num_employees

The average number of employees assigned to each plant.

net_generation_mwh

Net electricity generation for the specified period in megawatt-hours (MWh).

capex_land

Cost of plant: land and land rights (USD).

capex_structures

Cost of plant: structures and improvements (USD).

capex_equipment

Cost of plant: equipment (USD).

capex_total

Total cost of plant (USD).

capex_per_mw

Cost of plant per megawatt of installed (nameplate) capacity. Nominal USD.

opex_operations

Production expenses: operations, supervision, and engineering (USD).

opex_fuel

Production expenses: fuel (USD).

opex_coolants

Cost of coolants and water (nuclear plants only)

opex_steam

Steam expenses.

opex_steam_other

Steam from other sources.

opex_transfer

Steam transferred (Credit).

opex_electric

Production expenses: electric expenses (USD).

opex_misc_power

Miscellaneous steam (or nuclear) expenses.

opex_rents

Production expenses: rents (USD).

opex_allowances

Allowances.

opex_engineering

Production expenses: maintenance, supervision, and engineering (USD).

opex_structures

Production expenses: maintenance of structures (USD).

opex_boiler

Maintenance of boiler (or reactor) plant.

opex_plants

Maintenance of electrical plant.

opex_misc_steam

Maintenance of miscellaneous steam (or nuclear) plant.

opex_production_total

Total operating expenses.

opex_per_mwh

Total production expenses (USD per MWh generated).

asset_retirement_cost

Asset retirement cost (USD).

core_ferc1__yearly_transmission_lines_sched422

package: pudl

Annual time series of statistics about transmission lines.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 422)

Primary key:

This table has no primary key. Each record of this table is supposed to represent one stretch of a transmission line, but there are no IDs and many nulls in the fields which would nominally distinguish unique transmission lines.

Usage Warnings

  • Contains columns which may appear categorical, but are actually free text.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

Information describing transmission lines, the cost of lines, annual operating and capital expenses, etc. This table includes transmission lines having nominal voltage of 132 kilovolts or greater. Transmission lines below these voltages are required to be reported in group totals only for each voltage.

Columns
record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

start_point

The starting point of a transmission line.

end_point

The end point of a transmission line.

operating_voltage_kv

The operating voltage, expressed kilo-volts, for three-phase 60 cycle alternative current transmission lines.

designed_voltage_kv

Manufactured (Designed) voltage, expressed in kilo-volts, for three-phase 60 cycle alternative current transmission lines

supporting_structure_type

Supporting structure of the transmission line.

transmission_line_length_miles

Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are stand alone structures (whose cost are reported on a stand-alone basis).

transmission_line_and_structures_length_miles

Length (in pole miles or circuit miles (if transmission lines are underground)) for lines that are agrregated with other lines / structures (whose cost are aggregated and combined with other structures).

num_transmission_circuits

Number of circuits in a transmission line.

conductor_size_and_material

Size of transmission conductor and material of the transmission line.

capex_land

Cost of Land and land rights for the transmission line.

capex_other

Construction and other costs for the transmission line.

capex_total

Total costs for the transmission line.

opex_operations

Operating expenses for the transmission line.

opex_maintenance

Maintenance expenses for the transmission line.

opex_rents

Rent expenses for the transmission line.

opex_total

Overall expenses for the transmission line.

core_ferc1__yearly_utility_plant_summary_sched200

package: pudl

Annual time series of utility plant and accumulated provisions for depreciation, amortization and depletion of utility plant assets.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 1 -- Annual Report of Major Electric Utilities (Schedule 200)

Primary key:

utility_id_ferc1, report_year, utility_type, utility_plant_asset_type

Usage Warnings

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

report_year

Four-digit year in which the data was reported.

utility_type

Listing of utility plant types. Examples include Electric Utility, Gas Utility, and Other Utility.

utility_type_other

Freeform description of type of utility reported in one of the other three other utility_type sections in the core_ferc1__yearly_utility_plant_summary_sched200 table. This field is reported only in the DBF reporting years (1994-2020).

utility_plant_asset_type

Type of utility plant asset reported in the core_ferc1__yearly_utility_plant_summary_sched200 table. Assets include those leased to others, held for future use, construction work-in-progress and details of accumulated depreciation.

row_type_xbrl

Indicates whether the value reported in the row is calculated, or uniquely reported within the table.

ending_balance

Account balance at end of year.

record_id

Identifier indicating original FERC Form 1 source record. format: {table_name}_{report_year}_{report_prd}_{respondent_id}_{spplmnt_num}_{row_number}. Unique within FERC Form 1 DB tables which are not row-mapped.

core_ferc714__hourly_planning_area_demand

package: pudl

Hourly time series of electricity demand by planning area.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report (Part III, Schedule 2a)

Primary key:

respondent_id_ferc714, datetime_utc

Usage Warnings

  • The datetime_utc timestamps have been cleaned due to inconsistent datetime reporting. See below for additional details.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table includes data from the pre-2021 CSV raw source as well as the newer 2021 through present XBRL raw source.

This table includes three respondent ID columns: one from the CSV raw source, one from the XBRL raw source and another that is PUDL-derived that links those two source ID's together. This table has filled in source IDs for all records so you can select the full timeseries for a given respondent from any of these three IDs.

An important caveat to note is that there was some cleaning done to the datetime_utc timestamps. The Form 714 includes sparse documentation for respondents for how to interpret timestamps - the form asks respondents to provide 24 instances of hourly demand for each day. The form is labeled with hour 1-24. There is no indication if hour 1 begins at midnight.

The XBRL data contained several formats of timestamps. Most records corresponding to hour 1 of the Form have a timestamp with hour 1 as T1. About two thirds of the records in the hour 24 location of the form have a timestamp with an hour reported as T24 while the remaining third report this as T00 of the next day. T24 is not a valid format for the hour of a datetime, so we convert these T24 hours into T00 of the next day. A smaller subset of the respondents reports the 24th hour as the last second of the day - we also convert these records to the T00 of the next day.

Columns
respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

respondent_id_ferc714_csv

FERC Form 714 respondent ID from CSV reported data - published from years: 2006-2020. This ID is linked to the newer years of reported XBRL data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as respondent_id. Note that this ID does not correspond to FERC respondent IDs from other forms.

respondent_id_ferc714_xbrl

FERC Form 714 respondent ID from XBRL reported data - published from years: 2021-present. This ID is linked to the older years of reported CSV data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as entity_id. Note that this ID does not correspond to FERC respondent IDs from other forms.

report_date

Date reported.

datetime_utc

Date and time converted to Coordinated Universal Time (UTC).

timezone

IANA timezone name

demand_mwh

Electricity demand (energy) within a given timeframe.

core_ferc714__respondent_id

package: pudl

Respondent identification.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report (Part I, Schedule 1)

Primary key:

respondent_id_ferc714

Usage Warnings

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

respondent_id_ferc714_csv

FERC Form 714 respondent ID from CSV reported data - published from years: 2006-2020. This ID is linked to the newer years of reported XBRL data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as respondent_id. Note that this ID does not correspond to FERC respondent IDs from other forms.

respondent_id_ferc714_xbrl

FERC Form 714 respondent ID from XBRL reported data - published from years: 2021-present. This ID is linked to the older years of reported CSV data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as entity_id. Note that this ID does not correspond to FERC respondent IDs from other forms.

respondent_name_ferc714

Name of the utility, balancing area authority, or planning authority responding to FERC Form 714.

eia_code

EIA utility or balancing area authority ID associated with this FERC Form 714 respondent. Note that many utilities are also balancing authorities and in many cases EIA uses the same integer ID to identify a utility in its role as a balancing authority AND as a utility, but there is no requirement that these IDs be the same, and in a number of cases they are different.

core_ferc714__yearly_planning_area_demand_forecast

package: pudl

Annual time series of 10-year forecasted summer and winter peak demand and annual net energy per planning area.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 714 -- Annual Electric Balancing Authority Area and Planning Area Report (Part III, Schedule 2b)

Primary key:

respondent_id_ferc714, report_year, forecast_year

Usage Warnings

  • This table includes data from the pre-2021 CSV raw source as well as the newer 2021 through present XBRL raw source. We created the respondent_id_ferc714 field to blend disparate IDs from the CSV and XBRL data over time. See the core_ferc714__respondent_id table for links to the original source IDs.

  • There is a small handful of respondents (~11) that report more than 10 years and an even smaller handful that report less than 10 (~9).

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table contains forecasted net demand (MWh) as well as summer and winter peak demand (MW) for the next ten years after after the report_year.

Columns
respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

report_year

Four-digit year in which the data was reported.

forecast_year

Four-digit year that applies to a particular forecasted value.

summer_peak_demand_forecast_mw

The maximum forecasted hourly sumemr load (for the months of June through September).

winter_peak_demand_forecast_mw

The maximum forecasted hourly winter load (for the months of January through March).

net_demand_forecast_mwh

Net forecasted electricity demand for the specific period in megawatt-hours (MWh).

core_ferc__codes_accounts

package: pudl

Code table containing descriptions of categorical codes for account numbers from the FERC Uniform System of Accounts for Electric Plant.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC -- Mix of multiple FERC Forms

Primary key:

ferc_account_id

Usage Warnings

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

These codes are defined in Code of Federal Regulations (CFR) Title 18, Chapter I, Subchapter C, Part 101. (See e.g. https://www.law.cornell.edu/cfr/text/18/part-101).

Columns
ferc_account_id

Account identifier from FERC's Uniform System of Accounts for Electric Plant. Includes higher level labeled categories.

ferc_account_description

Description of the FERC account.

core_ferceqr__contracts

package: pudl

Contains information about contracts between companies selling and buying electricity market products.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 920 -- Electric Quarterly Report (EQR)

Primary key:

This table has no primary key. The FERC EQR contracts table has no natural primary key.

Usage Warnings

  • This table is experimental and/or a work in progress and may change in the future.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
year_quarter

Year-quarter corresponding to record. Formatted like YYYYq{1-4}.

seller_company_id_ferc

The Company Identifier (CID) obtained through FERC's Company Registration system corresponding to the selling company.

contract_unique_id

An identifier beginning with the letter “C” and followed by a number (e.g., 'C1', 'C2') used to designate a record containing contract information. Note that these contract IDs may only be unique within the context of a particular seller, seller-buyer pair, or timeframe. FERC documentation of the field is limited.

seller_company_name

The name of the company that is authorized to make sales as indicated in the company’s FERC tariff(s) or that is required to file the EQR under section 220 of the Federal Power Act. This name must match the name provided as seller_company_name in the core_ferceqr__quarterly_identity table. There are a handful of cases in which this requirement is violated, so any joins between tables should rely on company_id_ferc, not the company names.

customer_company_name

The name of the purchaser of contract products and services. Unlike the seller_company_name this name is not guaranteed to match a name in the core_ferceqr__quarterly_identity name. In addition, the same customer company may appear with different names in different contracts and transactions, since this field is an unconstrained string chosen by the seller.

contract_affiliate

If True, this field indicates the customer is an affiliate of the seller. The customer is an affiliate if it controls, is controlled by, or is under common control with the seller. This includes a division that operates as a functional unit. A customer of a seller who is an Exempt Wholesale Generator may be defined as an affiliate under the Public Utility Holding Company Act and the FPA.

ferc_tariff_reference

The FERC tariff reference cites the document that specifies the terms and conditions under which a Seller is authorized to make transmission sales, power sales or sales of related jurisdictional services at cost-based rates or at market-based rates. If the sales are market-based, the tariff that is specified in the FERC order granting the Seller Market Based Rate Authority must be listed. If a non-public utility does not have a FERC Tariff Reference, it should enter “NPU” for the FERC Tariff Reference.

contract_service_agreement_id

Unique identifier given to each service agreement that can be used by the Seller to produce the agreement, if requested. The identifier may be the number assigned by FERC for those service agreements that have been filed with and accepted by the Commission, or it may be generated as part of an internal identification system.

contract_execution_date

The date the contract was signed. If the parties signed on different dates, use the most recent date signed.

commencement_date_of_contract_term

The date the terms of the contract reported in fields 18, 23 and 25 through 44 (as defined in the data dictionary) became effective. If those terms became effective on multiple dates (i.e., due to one or more amendments), the date to be reported in this field is the date the most recent amendment became effective. If the contract or the most recent reported amendment does not have an effective date, the date when service began pursuant to the contract or most recent reported amendment may be used. If the terms reported in fields 18, 23 and 25 through 44 have not been amended since January 1, 2009, the initial date the contract became effective (or absent an effective date the initial date when service began) may be used.

contract_termination_date

The date that the contract expires.

actual_termination_date

The date the contract actually terminates.

extension_provision_description

Description of terms that provide for the continuation of the contract.

class_name

F - Firm: For transmission sales, a service or product that always has priority over non-firm service. For power sales, a service or product that is not interruptible for economic reasons. NF - Non-firm: For transmission sales, a service that is reserved and/or scheduled on an as-available basis and is subject to curtailment or interruption at a lesser priority compared to Firm service. For an energy sale, a service or product for which delivery or receipt of the energy may be interrupted for any reason or no reason, without liability on the part of either the buyer or seller. UP - Unit Power Sale: Designates a dedicated sale of energy and capacity from one or more than one specified generation unit(s). N/A: To be used only when the other available Class Names do not apply.

term_name

Contracts with durations of one year or greater are long-term (LT). Contracts with shorter durations are short-term (ST).

increment_name

5: Terms of the contract (if specifically noted in the contract) set for more than 0 minutes and less than or equal to 5 minutes (> 0 and ≤ 5 minutes). 15: Terms of the contract (if specifically noted in the contract) set for more than 5 minutes and less than or equal to 15 minutes (> 5 and ≤ 15 minutes). H: Terms of the contract (if specifically noted in the contract) set for more than 15 minutes and less than or equal to 6 hours (> 15 minutes and ≤ 6 hours). D: Terms of the contract (if specifically noted in the contract) set for more than 6 and up to 60 hours (> 6 and ≤ 60 hours). W: Terms of the contract (if specifically noted in the contract) set for over 60 hours and up to 168 hours (> 60 and ≤ 168 hours). M: Terms of the contract (if specifically noted in the contract) set for more than 168 hours up to, but not including, one year (> 168 hours and < 1 year). Y: Terms of the contract (if specifically noted in the contract) set for one year or more (≥ 1 year). N/A: Terms of the contract do not specify an increment.

increment_peaking_name

FP: The product described may be sold during those hours designated as on-peak and off-peak at the point of delivery. OP: The product described may be sold only during those hours designated as off-peak at the point of delivery. P: The product described may be sold only during those hours designated as on-peak at the point of delivery. N/A: To be used only when the increment peaking name is not specified in the contract.

product_type_name

CB: Energy, capacity or ancillary services sold under a FERC-approved cost-based rate tariff. CR: An agreement under which a transmission provider sells, assigns or transfers all or portion of its rights to an eligible customer. CR-AD: Transmission capacity reassignments reported in Atlantic Daylight time. CR-AP: Transmission capacity reassignments reported in Atlantic Prevailing time. CR-AS: Transmission capacity reassignments reported in Atlantic Standard time. CR-CD: Transmission capacity reassignments reported in Central Daylight time. CR-CP: Transmission capacity reassignments reported in Central Prevailing time. CR-CS: Transmission capacity reassignments reported in Central Standard time. CR-ED: Transmission capacity reassignments reported in Eastern Daylight time. CR-EP: Transmission capacity reassignments reported in Eastern Prevailing time. CR-ES: Transmission capacity reassignments reported in Eastern Standard time. CR-MD: Transmission capacity reassignments reported in Mountain Daylight time. CR-MP: Transmission capacity reassignments reported in Mountain Prevailing time. CR-MS: Transmission capacity reassignments reported in Mountain Standard time. CR-PD: Transmission capacity reassignments reported in Pacific Daylight time. CR-PP: Transmission capacity reassignments reported in Pacific Prevailing time. CR-PS: Transmission capacity reassignments reported in Pacific Standard time. MB: Energy, capacity or ancillary services sold under the seller’s FERC-approved market-based rate tariff. T: The product is sold under a FERC-approved transmission tariff. NPU: The product is sold by a non-public utility that is required to file the EQR under section 220 of the Federal Power Act. OTHER: The product cannot be characterized by the other product type names.

product_name

Description of product being offered.

quantity

Quantity for the contract product identified.

units

Measure stated in the contract for the product sold.

rate

The charge for the product per unit as stated in the contract.

rate_minimum

Minimum rate to be charged per the contract, if a range is specified.

rate_maximum

Maximum rate to be charged per the contract, if a range is specified.

rate_description

Text description of rate. If the rate is currently available on the FERC website, a citation of the FERC Accession Number and the relevant FERC tariff including page number or section may be included instead of providing the entire rate algorithm. If the rate is not available on the FERC website, include the rate algorithm, if rate is calculated. If the algorithm would exceed the 300 character field limit, it may be provided in a descriptive summary (including bases and methods of calculations) with a detailed citation of the relevant FERC tariff including page number and section.

rate_units

Measure stated in the contract for the product sold. FERC EQR.

point_of_receipt_balancing_authority

The registered Balancing Authority (formerly called NERC Control Area) where service begins for a transmission or transmission-related jurisdictional sale. The Balancing Authority will be identified with the abbreviation used in OASIS applications. If receipt occurs at a trading hub, the term 'Hub' should be used.

point_of_receipt_specific_location

The specific location at which the product is received if designated in the contract. If receipt occurs at a trading hub, a standardized hub name must be used. If more points of receipt are listed in the contract than can fit into the 50 character space, a description of the collection of points may be used. 'Various' alone, is unacceptable unless the contract itself uses that terminology.

point_of_delivery_balancing_authority

The registered Balancing Authority (formerly called NERC Control Area) where a jurisdictional product is delivered and/or service ends for a transmission or transmission-related jurisdictional sale. The Balancing Authority will be identified with the abbreviation used in OASIS applications. If delivery occurs at the interconnection of two control areas, the control area that the product is entering should be used. If delivery occurs at a trading hub, the term 'Hub' should be used.

point_of_delivery_specific_location

The specific location at which the product is delivered if designated in the contract. If receipt occurs at a trading hub, a standardized hub name must be used.

begin_date

First date and time for the sale of the product at the rate specified.

end_date

Last date and time for the sale of the product at the rate specified.

core_ferceqr__quarterly_identity

package: pudl

Quarterly time series of individuals who filed FERC EQR for a company in a given quarter.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 920 -- Electric Quarterly Report (EQR)

Primary key:

This table has no primary key. The primary key ought to be ['year_quarter', 'company_id_ferc', 'filer_unique_id'], where filer_unique_id is an employee-level ID. However, a handful of companies have erroneously reported the same filer_unique_id for multiple employees, resulting in duplicate records. In other cases, there appear to be multiple filings in a given quarter for the same company and filer, resulting in additional duplicates. Thus, there is no reliable natural primary key for the identity table.

Usage Warnings

  • This table is experimental and/or a work in progress and may change in the future.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
year_quarter

Year-quarter corresponding to record. Formatted like YYYYq{1-4}.

company_id_ferc

The Company Identifier (CID) obtained through FERC's Company Registration system.

filer_unique_id

(Seller) An identifier (e.g., “FS1”, “FS2”) used to designate a record containing Seller identification information. One record for each seller company must be included in an EQR for a given quarter. (Agent) – An identifier (i.e., “FA1”) used to designate a record containing Agent identification information. One record with the FA1 identifier must be included in an EQR for a given quarter.

company_name

Name of company submitting SEC 10k filing.

contact_name

(Seller) – The name of the contact for the company authorized to make sales as indicated in the company’s FERC tariff(s) or that is required to file the EQR under section 220 of the Federal Power Act. (Agent) – Name of the person who prepared the filing

contact_title

Title of of utility contact 1.

contact_address

Street address for contact identified in contact_name.

contact_city

City for contact identified in contact_name.

contact_state

State for contact identified in contact_name.

contact_zip

Zip code for contact identified in contact_name.

contact_country_name

Country (USA, Canada, Mexico, or United Kingdom) for contact identified in contact_name.

contact_phone

Phone number for contact identified in contact_name.

contact_email

Email for contact identified in contact_name.

transactions_reported_to_index_price_publishers

Sellers should indicate whether they have reported their sales transactions to index price publisher(s). If they have, Sellers should indicate specifically which index publisher(s) in Field Number 73.

core_ferceqr__quarterly_index_pub

package: pudl

Quarterly time series of electricity market price indices that individual EQR filers reported transactions to.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 920 -- Electric Quarterly Report (EQR)

Primary key:

year_quarter, company_id_ferc, filer_unique_id

Usage Warnings

  • This table is experimental and/or a work in progress and may change in the future.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
year_quarter

Year-quarter corresponding to record. Formatted like YYYYq{1-4}.

company_id_ferc

The Company Identifier (CID) obtained through FERC's Company Registration system.

filer_unique_id

(Seller) An identifier (e.g., “FS1”, “FS2”) used to designate a record containing Seller identification information. One record for each seller company must be included in an EQR for a given quarter. (Agent) – An identifier (i.e., “FA1”) used to designate a record containing Agent identification information. One record with the FA1 identifier must be included in an EQR for a given quarter.

seller_company_name

The name of the company that is authorized to make sales as indicated in the company’s FERC tariff(s) or that is required to file the EQR under section 220 of the Federal Power Act. This name must match the name provided as seller_company_name in the core_ferceqr__quarterly_identity table. There are a handful of cases in which this requirement is violated, so any joins between tables should rely on company_id_ferc, not the company names.

index_price_publisher_name

Name of index price publisher, which can be one of the following: AM - Argus Media, EIG - Energy Intelligence Group, Inc., IP - Intelligence Press, P - Platts, B - Bloomberg, PDX - Powerdex, SNL - SNL Energy

transactions_reported

Description of the types of transactions reported to the index publisher identified in this record.

core_ferceqr__transactions

package: pudl

Contains information about individual electricity market transactions that took place during a given reporting quarter. Reported by the seller.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

FERC Form 920 -- Electric Quarterly Report (EQR)

Primary key:

year_quarter, seller_company_id_ferc, transaction_unique_id

Usage Warnings

  • This table is experimental and/or a work in progress and may change in the future.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
year_quarter

Year-quarter corresponding to record. Formatted like YYYYq{1-4}.

seller_company_id_ferc

The Company Identifier (CID) obtained through FERC's Company Registration system corresponding to the selling company.

transaction_unique_id

An identifier beginning with the letter “T” and followed by a number (e.g., “T1”, “T2”) used to designate a record containing transaction information. One record for each transaction record must be included in an EQR for a given quarter. A new transaction record must be used every time a price changes in a sale. Note, these ID's are only unique for a single company_identifier and year_quarter.

seller_company_name

The name of the company that is authorized to make sales as indicated in the company’s FERC tariff(s) or that is required to file the EQR under section 220 of the Federal Power Act. This name must match the name provided as seller_company_name in the core_ferceqr__quarterly_identity table. There are a handful of cases in which this requirement is violated, so any joins between tables should rely on company_id_ferc, not the company names.

customer_company_name

The name of the purchaser of contract products and services. Unlike the seller_company_name this name is not guaranteed to match a name in the core_ferceqr__quarterly_identity name. In addition, the same customer company may appear with different names in different contracts and transactions, since this field is an unconstrained string chosen by the seller.

ferc_tariff_reference

The FERC tariff reference cites the document that specifies the terms and conditions under which a Seller is authorized to make transmission sales, power sales or sales of related jurisdictional services at cost-based rates or at market-based rates. If the sales are market-based, the tariff that is specified in the FERC order granting the Seller Market Based Rate Authority must be listed. If a non-public utility does not have a FERC Tariff Reference, it should enter “NPU” for the FERC Tariff Reference.

contract_service_agreement_id

Unique identifier given to each service agreement that can be used by the Seller to produce the agreement, if requested. The identifier may be the number assigned by FERC for those service agreements that have been filed with and accepted by the Commission, or it may be generated as part of an internal identification system.

seller_transaction_id

Unique reference number assigned by the Seller for each transaction. May only be unique in the context of the seller's internal record keeping. This is an unrestricted text field.

transaction_begin_date

First date and time the product is sold during the quarter.

transaction_end_date

Last date and time the product is sold during the quarter.

trade_date

The date upon which the parties made the legally binding agreement on the price of a transaction.

exchange_brokerage_service

If a broker service is used to consummate or effectuate a transaction, the term “Broker” shall be provided. If an exchange is used, the specific exchange that is used shall be selected from the Commission-provided list. Allowed values include BROKER, ICE, NODAL, and NYMEX.

type_of_rate

FIXED: A fixed charge per unit of consumption. No variables are used to determine this rate. FORMULA: A calculation of a rate based upon a formula that does not contain an electric index component. ELECTRIC INDEX: A calculation of a rate based upon an index or a formula that contains an electric index component. An electric index includes an index published by an index publisher such as those required to be listed in Field Number 73 or a price published by an RTO/ISO (e.g., PJM West or Illinois Hub). RTO/ISO: If the price is the result of an RTO/ISO market or the sale is made to the RTO/ISO.

timezone

A code representing the so-called 'time zone' in which the sale was made. However, these codes do not actually correspond to time zones. Rather, they indicate an offset from UTC, which changes according to daylight savings vs. standard time. The codes are as follows: AD: Atlantic Daylight Time (UTC-3), AP: Atlantic Prevailing Time, AS: Atlantic Standard Time (UTC-4), CD: Central Daylight Time (UTC-5), CP: Central Prevailing Time, CS: Central Standard Time (UTC-6), ED: Eastern Daylight Time (UTC-4), EP: Eastern Prevailing Time, ES: Eastern Standard Time (UTC-5), MD: Mountain Daylight Time (UTC-6), MP: Mountain Prevailing Time, MS: Mountain Standard Time (UTC-7), PD: Pacific Daylight Time (UTC-7), PP: Pacific Prevailing Time, PS: Pacific Standard Time (UTC-8), UT: Coordinated Universal Time (UTC+0)

class_name

F - Firm: For transmission sales, a service or product that always has priority over non-firm service. For power sales, a service or product that is not interruptible for economic reasons. NF - Non-firm: For transmission sales, a service that is reserved and/or scheduled on an as-available basis and is subject to curtailment or interruption at a lesser priority compared to Firm service. For an energy sale, a service or product for which delivery or receipt of the energy may be interrupted for any reason or no reason, without liability on the part of either the buyer or seller. UP - Unit Power Sale: Designates a dedicated sale of energy and capacity from one or more than one specified generation unit(s). N/A: To be used only when the other available Class Names do not apply.

term_name

Contracts with durations of one year or greater are long-term (LT). Contracts with shorter durations are short-term (ST).

increment_name

5: Terms of the contract (if specifically noted in the contract) set for more than 0 minutes and less than or equal to 5 minutes (> 0 and ≤ 5 minutes). 15: Terms of the contract (if specifically noted in the contract) set for more than 5 minutes and less than or equal to 15 minutes (> 5 and ≤ 15 minutes). H: Terms of the contract (if specifically noted in the contract) set for more than 15 minutes and less than or equal to 6 hours (> 15 minutes and ≤ 6 hours). D: Terms of the contract (if specifically noted in the contract) set for more than 6 and up to 60 hours (> 6 and ≤ 60 hours). W: Terms of the contract (if specifically noted in the contract) set for over 60 hours and up to 168 hours (> 60 and ≤ 168 hours). M: Terms of the contract (if specifically noted in the contract) set for more than 168 hours up to, but not including, one year (> 168 hours and < 1 year). Y: Terms of the contract (if specifically noted in the contract) set for one year or more (≥ 1 year). N/A: Terms of the contract do not specify an increment.

increment_peaking_name

FP: The product described may be sold during those hours designated as on-peak and off-peak at the point of delivery. OP: The product described may be sold only during those hours designated as off-peak at the point of delivery. P: The product described may be sold only during those hours designated as on-peak at the point of delivery. N/A: To be used only when the increment peaking name is not specified in the contract.

product_name

Description of product being offered.

rate_units

Measure stated in the contract for the product sold. FERC EQR.

point_of_delivery_balancing_authority

The registered Balancing Authority (formerly called NERC Control Area) where a jurisdictional product is delivered and/or service ends for a transmission or transmission-related jurisdictional sale. The Balancing Authority will be identified with the abbreviation used in OASIS applications. If delivery occurs at the interconnection of two control areas, the control area that the product is entering should be used. If delivery occurs at a trading hub, the term 'Hub' should be used.

point_of_delivery_specific_location

The specific location at which the product is delivered if designated in the contract. If receipt occurs at a trading hub, a standardized hub name must be used.

transaction_quantity

The quantity of the product in this transaction record.

price

Actual price charged for the product per unit. The price reported cannot be averaged or otherwise aggregated.

standardized_quantity

For product names energy, capacity, and booked out power only. Specify the quantity in MWh if the product is energy or booked out power and specify the quantity in MW-month if the product is capacity or booked out power.

standardized_price

For product names energy, capacity, and booked out power only. Specify the price in $/MWh if the product is energy or booked out power and specify the price in $/MW-month if the product is capacity or booked out power.

total_transmission_charge

Payments received for transmission services when explicitly identified.

total_transaction_charge

transaction_quantity * price + total_transmission_charge.

core_gridpathratoolkit__assn_generator_aggregation_group

package: pudl

Association table providing connections between individual generator IDs and their corresponding aggregation group when creating aggregated profiles.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

GridPath Resource Adequacy Toolkit Data

Primary key:

plant_id_eia, generator_id

Usage Warnings

  • The data is no longer being collected or reported in this way.

Additional Details

This table defines which individual generator profiles are combined when creating aggregated capacity factors / generation profiles. Generator capacity is used to weight the contribution of each generator in the resulting aggregated profiles, and is made available in this table for convenience and legibility. The resulting aggregated profiles are stored in out_gridpathratoolkit__hourly_available_capacity_factor.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

generator_id

Generator ID is usually numeric, but sometimes includes letters. Make sure you treat it as a string!

aggregation_group

A label identifying a group of aggregated generator capacity factors.

capacity_mw

Total installed (nameplate) capacity, in megawatts.

include_generator

Every row in the aggregation table describes a single generator. Groups of rows with the same aggregation are combined using a capacity weighted average to produce an aggregate generation profile. A few generators are not included in that aggregation process. This column determines whether a generator is included.

core_nrelatb__yearly_projected_cost_performance

package: pudl

Annual time series of projections of costs and performance for various technologies.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

NREL Annual Technology Baseline (ATB) for Electricity

Primary key:

This table has no primary key. The fields technology_description_detail_1 and technology_description_detail_2 occasionally contain nulls, preventing this table from having a primary key.

Additional Details

This table includes NREL's projections for capital and operational costs as well as operational attributes for a variety of technologies for each of the NREL ATB scenarios. Each new year of ATB data includes projections for the next 25 years.

All of the NREL tables are related. If you want to understand the financial assumptions of a given scenario in this table, see core_nrelatb__yearly_projected_financial_cases. The other two core_nrealatb tables include attributes about technologies in these projections.

Columns
report_year

Four-digit year in which the data was reported.

model_case_nrelatb

NREL's financial assumption cases. There are two cases which effect project financial assumptions: R&D Only Case and Market + Policies Case. R&D Only includes only projected R&D improvements while Market + Policy case includes policy and tax incentives. https://atb.nrel.gov/electricity/2024/financial_cases_&_methods

model_tax_credit_case_nrelatb

NREL's tax credit assumption cases. There are two types of tax credits: production tax credit (PTC) and investment tax credit (ITC). For more detail, see: https://atb.nrel.gov/electricity/2024/financial_cases_&_methods

projection_year

The year of the projected value.

technology_description

High level description of the technology used by the generator to produce electricity.

cost_recovery_period_years

The period over which the initial capital investment to build a plant is recovered.

scenario_atb
technology_description_detail_1

Technology details indicate resource levels and specific technology subcategories.

technology_description_detail_2

Technology details indicate resource levels and specific technology subcategories.

capacity_factor

Fraction of potential generation that was actually reported for a plant part.

capex_per_kw

Capital cost (USD). Expenditures required to achieve commercial operation of the generation plant.

capex_overnight_per_kw

capex if plant could be constructed overnight (i.e., excludes construction period financing); includes on-site electrical equipment (e.g., switchyard), a nominal-distance spur line (<1 mi), and necessary upgrades at a transmission substation.

capex_overnight_additional_per_kw

capex for retrofits if plant could be constructed overnight (i.e., excludes construction period financing); includes on-site electrical equipment (e.g., switchyard), a nominal-distance spur line (<1 mi), and necessary upgrades at a transmission substation.

capex_grid_connection_per_kw

Overnight capital cost includes a nominal-distance spur line (<1 mi) for all technologies, and for offshore wind, it includes export cable and construction period transit costs for a 30-km distance from shore. Project-specific costs lines that are based on distance to existing transmission are not included. This only applies to offshore wind.

capex_construction_finance_factor

Portion of all-in capital cost associated with construction period financing. This factor is applied to an overnight capital cost to represent the financing costs incurred during the construction period.

fuel_cost_per_mwh

Fuel costs in USD$/MWh. NREL-derived values using heat rates.

heat_rate_mmbtu_per_mwh

Fuel content per unit of electricity generated.

heat_rate_penalty

Heat rate penalty for retrofitting. This column only has contents to retrofit technologies. It seems to be a rate between 0.35 and 0.09

levelized_cost_of_energy_per_mwh

Levelized cost of energy (LCOE) is a summary metric that combines the primary technology cost and performance parameters: capital expenditures, operations expenditures, and capacity factor.

net_output_penalty

Penalty for retrofitting for net output. This column only has contents to retrofit technologies. It seems to be a rate between -0.25 and -0.08

opex_fixed_per_kw

Fixed operation and maintenance expenses. Annual expenditures to operate and maintain equipment that are not incurred on a per-unit-energy basis.

opex_variable_per_mwh

Operation and maintenance costs incurred on a per-unit-energy basis.

core_nrelatb__yearly_projected_financial_cases

package: pudl

Annual time series of financial assumptions for each model case (model_case_nrelatb), and sub-type of technology (technology_description).

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

NREL Annual Technology Baseline (ATB) for Electricity

Primary key:

report_year, model_case_nrelatb, projection_year, technology_description

Columns
report_year

Four-digit year in which the data was reported.

model_case_nrelatb

NREL's financial assumption cases. There are two cases which effect project financial assumptions: R&D Only Case and Market + Policies Case. R&D Only includes only projected R&D improvements while Market + Policy case includes policy and tax incentives. https://atb.nrel.gov/electricity/2024/financial_cases_&_methods

projection_year

The year of the projected value.

technology_description

High level description of the technology used by the generator to produce electricity.

inflation_rate

Rate of inflation. All dollar values are given in 2021 USD, using the Consumer Price Index for All Urban Consumers for dollar year conversions where the source year dollars do not match 2021.

interest_rate_during_construction_nominal

Also referred to as construction finance cost. Portion of all-in capital cost associated with construction period financing. It is a function of construction duration, capital fraction during construction, and interest during construction.

interest_rate_calculated_real

Calculated real interest rate.

interest_rate_nominal

Nominal interest rate.

rate_of_return_on_equity_calculated_real

Calculated real rate of return on equity.

rate_of_return_on_equity_nominal

Nomial rate of return on equity.

tax_rate_federal_state

Combined federal and state tax rate. The R&D model_case_nrelatb holds tax and inflation rates constant at assumed long-term values: 21 percent federal tax rate, 6 percent state tax rate (though actual state tax rates vary), and 2.5 percent inflation rate excludes effects of tax credits. The Market + Policy model_case_nrelatb applies federal tax credits and expires them as consistent with existing law and guidelines.

core_nrelatb__yearly_projected_financial_cases_by_scenario

package: pudl

Annual time series of additional financial assumptions for NREL ATB projections that also vary by technology innovation scenario (scenario_atb), tax credit case (model_tax_credit_case_nrelatb), and cost recovery period (cost_recovery_period_years).

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

NREL Annual Technology Baseline (ATB) for Electricity

Primary key:

This table has no primary key. There are a small number of records which have nulls in the cost_recovery_period_years column. Based on NREL's documentation, this seems to indicate that those records apply to any relevant cost_recovery_period_years. If those records were non-null, the primary keys of this table would be: ['report_year', 'model_case_nrelatb', 'model_tax_credit_case_nrelatb', 'projection_year', 'technology_description', 'scenario_atb', 'cost_recovery_period_years']

Columns
report_year

Four-digit year in which the data was reported.

model_case_nrelatb

NREL's financial assumption cases. There are two cases which effect project financial assumptions: R&D Only Case and Market + Policies Case. R&D Only includes only projected R&D improvements while Market + Policy case includes policy and tax incentives. https://atb.nrel.gov/electricity/2024/financial_cases_&_methods

model_tax_credit_case_nrelatb

NREL's tax credit assumption cases. There are two types of tax credits: production tax credit (PTC) and investment tax credit (ITC). For more detail, see: https://atb.nrel.gov/electricity/2024/financial_cases_&_methods

projection_year

The year of the projected value.

technology_description

High level description of the technology used by the generator to produce electricity.

scenario_atb
cost_recovery_period_years

The period over which the initial capital investment to build a plant is recovered.

capital_recovery_factor

Ratio of a constant annuity to the present value of receiving that annuity for a given length of time.

debt_fraction

Fraction of capital financed with debt; Debt fraction is assumed financed with equity; also referred to as the leverage ratio.

fixed_charge_rate

Amount of revenue per dollar of investment required that must be collected annually from customers to pay the carrying charges on that investment.

wacc_nominal

Nominal weighted average cost of capital - average expected rate that is paid to finance assets.

wacc_real

Real weighted average cost of capital - average expected rate that is paid to finance assets.

core_nrelatb__yearly_technology_status

package: pudl

Annual time series of technology statuses, indicating whether technologies in the ATB scenarios are mature, and whether they are the default technologies.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

NREL Annual Technology Baseline (ATB) for Electricity

Primary key:

This table has no primary key.

Columns
report_year

Four-digit year in which the data was reported.

technology_description

High level description of the technology used by the generator to produce electricity.

technology_description_detail_1

Technology details indicate resource levels and specific technology subcategories.

technology_description_detail_2

Technology details indicate resource levels and specific technology subcategories.

is_technology_mature

Indicator of whether the technology is mature. Technologies are definedas mature if a representative plant is operating or under constructionin the United States in the Base Year.

is_default

Indicator of whether the technology is default.

core_phmsagas__yearly_distribution_operators

package: pudl

Annual time series of distribution operator information.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part A)

Primary key:

report_id, report_date, operator_id_phmsa

Additional Details

This table contains operator-level information including office and headquarter location.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

operator_name_phmsa

PHMSA operator name.

office_street_address

Street address of an operator's office.

office_city

City where an operator's office is located.

office_county

County where an operator's office is located.

office_zip

Zipcode where an operator's office is located.

office_state

State where an operator's office is located.

headquarters_street_address

Street address for an operator's headquarters.

headquarters_city

City where an operator's headquarters are located.

headquarters_county

County where an operator's headquarters are located.

headquarters_state

State where an operator's headquarters are located.

headquarters_zip

Zipcode where an operator's headquarters are located.

additional_information

Any additional information which will assist in clarifying or classifying the reported data.

core_pudl__assn_eia_pudl_plants

package: pudl

Association table providing connections between EIA plant IDs and manually assigned PUDL plant IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

plant_id_eia

Usage Warnings

  • Contains information from multiple raw inputs.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_name_eia

Plant name.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

core_pudl__assn_eia_pudl_utilities

package: pudl

Association table providing connections between EIA utility IDs and manually assigned PUDL utility IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

utility_id_eia

Usage Warnings

  • Contains information from multiple raw inputs.

Columns
utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

core_pudl__assn_ferc1_dbf_pudl_utilities

package: pudl

Association table providing connections between PUDL-assigned FERC1 utility IDs and the native FERC1 DBF utility IDs originally reported as respondent_id.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

utility_id_ferc1_dbf

Usage Warnings

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_dbf

FERC-assigned respondent_id from DBF reporting years, identifying the reporting entity. Stable from year to year.

core_pudl__assn_ferc1_pudl_plants

package: pudl

Association table providing connections between FERC 1 plants and their manually assigned PUDL plant IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

utility_id_ferc1, plant_name_ferc1

Usage Warnings

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

FERC does not assign IDs to plants, so each FERC 1 plant is identified by a utility_id_ferc1 and a plant_name_ferc1.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

plant_name_ferc1

Name of the plant, as reported to FERC. This is a freeform string, not guaranteed to be consistent across references to the same plant.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

core_pudl__assn_ferc1_pudl_utilities

package: pudl

Association table providing connections between PUDL utility IDs and PUDL-assigned FERC1 utility IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

utility_id_ferc1

Usage Warnings

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

This table maps two manually assigned utility IDs: a PUDL ID and a FERC1 ID. The PUDL IDs link EIA and FERC1 utilities. The PUDL FERC1 IDs link records from older DBF respondent IDs and new XBRL entity IDs via core_pudl__assn_ferc1_dbf_pudl_utilities and core_pudl__assn_ferc1_xbrl_pudl_utilities respectively. This table is generated from a table stored in the PUDL repository: src/package_data/glue/utility_id_pudl.csv

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_name_ferc1

Name of the responding utility, as it is reported in FERC Form 1. For human readability only.

utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

core_pudl__assn_ferc1_xbrl_pudl_utilities

package: pudl

Association table providing connections between PUDL-assigned FERC1 utility IDs and the native FERC1 XBRL utility IDs originally reported as entity_id.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

utility_id_ferc1_xbrl

Usage Warnings

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
utility_id_ferc1

PUDL-assigned utility ID, identifying a FERC1 utility. This is an auto-incremented ID and is not expected to be stable from year to year.

utility_id_ferc1_xbrl

FERC-assigned entity_id from XBRL reporting years, identifying the reporting entity. Stable from year to year.

core_pudl__assn_ferc714_csv_pudl_respondents

package: pudl

Association table providing connections between PUDL-assigned FERC714 respondent IDs and native FERC714 CSV respondent IDs originally reported as respondent_id.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

respondent_id_ferc714, respondent_id_ferc714_csv

Usage Warnings

  • Contains information from multiple raw inputs.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

respondent_id_ferc714_csv

FERC Form 714 respondent ID from CSV reported data - published from years: 2006-2020. This ID is linked to the newer years of reported XBRL data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as respondent_id. Note that this ID does not correspond to FERC respondent IDs from other forms.

core_pudl__assn_ferc714_pudl_respondents

package: pudl

Entity table containing static information about PUDL-derived FERC 714 respondent IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

respondent_id_ferc714

Usage Warnings

  • Contains information from multiple raw inputs.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Additional Details

These IDs are used to connect older CSV data which uses different respondent IDs than the newer XBRL entity IDs. These IDs are manually assigned when new FERC 714 data is integrated, and any newly found utilities are added to the list with a new ID. This table is read in from a CSV stored in the PUDL repository: src/pudl/package_data/glue/respondent_id_ferc714.xlsx

Columns
respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

core_pudl__assn_ferc714_xbrl_pudl_respondents

package: pudl

Association table providing connections between PUDL-assigned FERC714 respondent IDs and native FERC714 XBRL respondent IDs originally reported as entity_id.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

respondent_id_ferc714, respondent_id_ferc714_xbrl

Usage Warnings

  • Contains information from multiple raw inputs.

  • FERC data is notoriously difficult to extract cleanly, and often contains free-form strings, non-labeled total rows and lack of IDs. See Notable Irregularities for details.

Columns
respondent_id_ferc714

PUDL-assigned identifying a respondent to FERC Form 714. This ID associates natively reported respondent IDs from the original CSV and XBRL data sources.

respondent_id_ferc714_xbrl

FERC Form 714 respondent ID from XBRL reported data - published from years: 2021-present. This ID is linked to the older years of reported CSV data through the PUDL-assigned respondent_id_ferc714 ID. This ID was originally reported as entity_id. Note that this ID does not correspond to FERC respondent IDs from other forms.

core_pudl__assn_utilities_plants

package: pudl

Association table providing connections between PUDL utility IDs and PUDL plant IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

utility_id_pudl, plant_id_pudl

Additional Details

This table is read in from a spreadsheet stored in the PUDL repository: src/pudl/package_data/glue/pudl_id_mapping.xlsx

Columns
utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

core_pudl__codes_data_maturities

package: pudl

Code table containing descriptions of categorical codes for maturity levels of data records. Some data sources report less-than-final data. PUDL sometimes includes this data, but use at your own risk.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

code

Usage Warnings

  • Contains information from multiple raw inputs.

Columns
code

Originally reported short code.

description

Long human-readable description of the meaning of a code/label.

core_pudl__codes_datasources

package: pudl

Code table containing descriptions of categorical codes for the data sources that PUDL integrates.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

datasource

Columns
datasource

Code identifying a dataset available within PUDL.

partitions

The data partitions used to generate this instance of the database.

doi

Unique digital object identifier of Zenodo archive.

pudl_version

The version of PUDL used to generate this database.

core_pudl__codes_imputation_reasons

package: pudl

Code table containing descriptions of categorical codes for the reasons a value might be flagged for imputation.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

code

Additional Details

Any time a column is imputed, there should be a corresponding column which contains codes for every value in the column that was imputed.

Columns
code

Originally reported short code.

description

Long human-readable description of the meaning of a code/label.

core_pudl__codes_subdivisions

package: pudl

Code table containing descriptions of categorical codes for states, provinces, and other sub-national political subdivisions.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

country_code, subdivision_code

Columns
country_code

Three letter ISO-3166 country code (e.g. USA or CAN).

country_name

Full country name (e.g. United States of America).

subdivision_code

Two-letter ISO-3166 political subdivision code (e.g. US state or Canadian province abbreviations like CA or AB).

subdivision_name

Full name of political subdivision (e.g. US state or Canadian province names like California or Alberta.

subdivision_type

ISO-3166 political subdivision type. E.g. state, province, outlying_area.

timezone_approx

IANA timezone name of the timezone which encompasses the largest portion of the population in the associated geographic area.

state_id_fips

Two digit state FIPS code.

division_name_us_census

Longer human readable name describing the US Census division.

division_code_us_census

Three-letter US Census division code as it appears in the bulk electricity data published by the EIA. Note that EIA splits the Pacific division into distinct contiguous (CA, OR, WA) and non-contiguous (AK, HI) states. For reference see this US Census region and division map: https://www2.census.gov/geo/pdfs/maps-data/maps/reference/us_regdiv.pdf

region_name_us_census

Human-readable name of a US Census region.

is_epacems_state

Indicates whether the associated state reports data within the EPA's Continuous Emissions Monitoring System.

core_pudl__entity_plants_pudl

package: pudl

Entity table containing static information about PUDL-assigned plant IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

plant_id_pudl

Additional Details

These IDs are manually generated each year when new FERC and EIA reporting is integrated, and any newly identified plants are added to the list with a new ID. Each ID maps to a power plant which is reported in at least one FERC or EIA data set. This table is read in from a spreadsheet stored in the PUDL repository: src/pudl/package_data/glue/pudl_id_mapping.xlsx

Columns
plant_id_pudl

A manually assigned PUDL plant ID. May not be constant over time.

plant_name_pudl

Plant name, chosen arbitrarily from the several possible plant names available in the plant matching process. Included for human readability only.

core_pudl__entity_utilities_pudl

package: pudl

Entity table containing static information about PUDL-assigned utility IDs.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

The Public Utility Data Liberation (PUDL) Project

Primary key:

utility_id_pudl

Additional Details

These IDs are manually generated each year when new FERC and EIA reporting is integrated, and any newly found utilities are added to the list with a new ID. Each ID maps to a power plant owning or operating entity which is reported in at least one FERC or EIA data set. This table is read in from a spreadsheet stored in the PUDL repository: src/pudl/package_data/glue/pudl_id_mapping.xlsx

Columns
utility_id_pudl

A manually assigned PUDL utility ID. May not be stable over time.

utility_name_pudl

Utility name, chosen arbitrarily from the several possible utility names available in the utility matching process. Included for human readability only.

core_sec10k__assn_exhibit_21_subsidiaries_and_eia_utilities

package: pudl

Association table providing connections between subsidiaries listed in Exhibit 21 and EIA utilities.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

subsidiary_company_id_sec10k

Usage Warnings

  • PUDL does not currently update its copy of this data.

Additional Details

An Exhibit 21 subsidiary is considered matched to an EIA utility if their names are identical. Only subsidiaries that don't file SEC 10-K themselves are included in this table. SEC 10-K filers have much more information available and are matched to EIA utilities using probabilistic record linkage in core_sec10k__assn_sec10k_filers_and_eia_utilities.

Columns
subsidiary_company_id_sec10k

PUDL-assigned ID for subsidiaries found in SEC 10-K Exhibit 21. The ID is created by concatenating the CIK of the company whose filing the subsidiary was found in, the subsidiary company's name, and location of incorporation. It is not guaranteed to be stable across different releases of PUDL and so should never be hard-coded in analyses.

utility_id_eia

The EIA Utility Identification number.

core_sec10k__assn_exhibit_21_subsidiaries_and_filers

package: pudl

Association table providing connections between subsidiaries listed in Exhibit 21 and their SEC central index key, if the subsidiary also files Form 10-K.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

subsidiary_company_id_sec10k

Usage Warnings

  • PUDL does not currently update its copy of this data.

Additional Details

Exhibit 21 subsidiaries and SEC 10-K filers are considered matched if they have identical names and the same location of incorporation.

Columns
subsidiary_company_id_sec10k

PUDL-assigned ID for subsidiaries found in SEC 10-K Exhibit 21. The ID is created by concatenating the CIK of the company whose filing the subsidiary was found in, the subsidiary company's name, and location of incorporation. It is not guaranteed to be stable across different releases of PUDL and so should never be hard-coded in analyses.

central_index_key

Identifier of the company in SEC database.

core_sec10k__assn_sec10k_filers_and_eia_utilities

package: pudl

Association table providing connections between SEC 10-K filers and EIA utilities.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

central_index_key, utility_id_eia

Usage Warnings

  • This data has been matched probabilistically using a machine learning model and contains some incompletions and errors. It should not be treated as ground truth data.

  • PUDL does not currently update its copy of this data.

Additional Details

SEC central index keys are matched to EIA utility IDs using probabilistic record linkage based on associated company information like company name, business and mailing addresses, and state of incorporation. The match between central_index_key and utility_id_eia is one to one and is not allowed to change over time. In cases where there were multiple candidate matches, the match with the highest probability is selected.

Columns
central_index_key

Identifier of the company in SEC database.

utility_id_eia

The EIA Utility Identification number.

core_sec10k__changelog_company_name

package: pudl

Changelog table tracking changes in the names each SEC 10-K filer has used.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

This table has no primary key.

Usage Warnings

  • PUDL does not currently update its copy of this data.

Additional Details

This table is extracted from the same SEC 10-K filing header information as core_sec10k__quarterly_company_information. Each filing reports the full history of name change associated with a company up to the date of that filing. Because individual companies may appear in multiple filings in the same year, and the same historical name changes will be reported in multiple years, the raw input data contains many duplicate entries, which are deduplicated to create this table. The original name change data only contains the former name and the date of the change.

Roughly 2% of all records describe multiple name changes happening on the same date (they are duplicates on the basis of central_index_key and name_change_date). This may be due to company name reporting inconsistencies or reporting errors in which the old and new company names have been swapped.

Columns
central_index_key

Identifier of the company in SEC database.

name_change_date

Date of last name change of the company.

company_name_old

Name of company prior to name change.

company_name

Name of company submitting SEC 10k filing.

core_sec10k__quarterly_company_information

package: pudl

Quarterly time series of company and filing information extracted from SEC 10-K filing headers.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

filename_sec10k, central_index_key

Usage Warnings

  • PUDL does not currently update its copy of this data.

Additional Details

While the SEC 10-K filings are submitted by a single company, they often contain references to many other related companies. Information about these companies and the filing itself are contained in text headers at the beginning of the filing. This table contains data extracted from those headers. Each record in this table represents a single observation of a company in a single filing. The filer_count indicates which referenced company within a filing header the record corresponds to.

Because the same company may be referenced in a number of different filings submitted in the same reporting period or even on the same day, this table contains apparently duplicative records about many companies, that may be distinguished only by the filename associated with the filing they appeared in and their filer count. Note that all references to a particular company may not be perfectly consistent across all filings in which they appear. The various company names, addresses, and other information associated with the company's unique and permanent central_index_key are later used as inputs into the probabilistic record linkage process.

Columns
filename_sec10k

Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.

central_index_key

Identifier of the company in SEC database.

filer_count

A counter indicating which observation of company data within an SEC 10-K filing header the record pertains to.

company_name

Name of company submitting SEC 10k filing.

fiscal_year_end

The end date of an SEC filing company's fiscal year, in MMDD format.

taxpayer_id_irs

Taxpayer ID of the company with the IRS.

incorporation_state

Two letter state code where company is incorporated.

industry_name_sic

Text description of Standard Industrial Classification (SIC)

industry_id_sic

Four-digit Standard Industrial Classification (SIC) code identifying the company's primary industry. SIC codes have been replaced by NAICS codes in many applications, but are still used by the SEC. See e.g. https://www.osha.gov/data/sic-manual for code definitions.

film_number

Document control number used in the SEC EDGAR database. The first four digits can be used to access scans of the document in the SEC's Virtual Private Reference Room.

sec10k_type

Specific version of SEC 10-K that was filed. 10-k: the standard annual report. 10-k/a: an amended version of the annual report. 10-k405: filed to report insider trading that was not reported in a timely fashion. 10-k405/a: an amended version of the 10-k405. 10-kt: submitted in lieu of or in addition to a standard 10-K annual report when a company changes the end of its fiscal year (e.g. due to a merger) leaving the company with a longer or shorter reporting period. 10-kt/a: an amended version of the 10-kt. 10-ksb: the annual report for small businesses, also known as penny stocks. 10-ksb/a: an amended version of the 10-ksb.

sec_act

SEC Act through which the form was enacted, e.g. 1934 act.

filing_number_sec

Filing number used internally by the SEC commission to track filing.

phone_number

Phone number for utility contact 1.

business_street_address

Street address of the company's place of business.

business_street_address_2

Second line of the street address of the company's place of business.

business_city

City where the company's place of business is located.

business_state

State where the company's place of business is located.

business_zip_code

Zip code of the company's place of business.

business_zip_code_4

Zip code suffix of the company's place of business.

business_postal_code

Non-US postal code of the company's place of business.

mail_street_address

Street portion of the company's mailing address.

mail_street_address_2

Second line of the street portion of the company's mailing address.

mail_city

City of the company's mailing address.

mail_state

State of the company's mailing address.

mail_zip_code

Zip code of the company's mailing address.

mail_zip_code_4

Zip code suffix of the company's mailing address.

mail_postal_code

Non-US postal code of the company's mailing address.

core_sec10k__quarterly_exhibit_21_company_ownership

package: pudl

Quarterly time series of subsidiary company ownership data from the SEC 10-K Exhibit 21 attachments.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

filename_sec10k, subsidiary_company_id_sec10k

Usage Warnings

  • This data has been extracted probabilistically using a machine learning model and contains some incompletions and errors. It should not be treated as ground truth data.

  • PUDL does not currently update its copy of this data.

Additional Details

Exhibit 21 is an unstructured text or PDF attachment to the main SEC 10-K filing that is used to describe the subsidiaries owned by the filing company. It may or may not provide the percentage of the subsidiary that is owned by the filing company, or the location of the subsidiary.

Columns
filename_sec10k

Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.

subsidiary_company_name

Name of subsidiary company.

subsidiary_company_location

Location of subsidiary company. This is the full US state name or country name and occasionally a two digit code that was not mapped to a full name during cleaning.

subsidiary_company_id_sec10k

PUDL-assigned ID for subsidiaries found in SEC 10-K Exhibit 21. The ID is created by concatenating the CIK of the company whose filing the subsidiary was found in, the subsidiary company's name, and location of incorporation. It is not guaranteed to be stable across different releases of PUDL and so should never be hard-coded in analyses.

fraction_owned

Proportion of generator ownership attributable to this utility.

core_sec10k__quarterly_filings

package: pudl

Quarterly time series of metadata describing SEC 10-K filings.

Processing:

Data has been cleaned and organized into well-modeled tables that serve as building blocks for downstream wide tables and analyses.

Source:

U.S. Securities and Exchange Commission (SEC) Form 10-K

Primary key:

filename_sec10k

Usage Warnings

  • PUDL does not currently update its copy of this data.

Additional Details

Each SEC 10-K filing is submitted by a single company, but may contain information about numerous other companies. This table indicates the company submitting the filing, as well as some information about the overall filing. Each filing is guaranteed to have a unique filename, but ~1% of all filings are one company submitting the same form multiple times on the same day, so the filename is the only available natural primary key.

Columns
filename_sec10k

Unique portion of the filename associated with the SEC 10-K filing in the EDGAR database. The full source URL can be reconstructed by prepending https://www.sec.gov/Archives/edgar/data/ and adding the .txt file type extension.

central_index_key

Identifier of the company in SEC database.

company_name

Name of company submitting SEC 10k filing.

sec10k_type

Specific version of SEC 10-K that was filed. 10-k: the standard annual report. 10-k/a: an amended version of the annual report. 10-k405: filed to report insider trading that was not reported in a timely fashion. 10-k405/a: an amended version of the 10-k405. 10-kt: submitted in lieu of or in addition to a standard 10-K annual report when a company changes the end of its fiscal year (e.g. due to a merger) leaving the company with a longer or shorter reporting period. 10-kt/a: an amended version of the 10-kt. 10-ksb: the annual report for small businesses, also known as penny stocks. 10-ksb/a: an amended version of the 10-ksb.

filing_date

Date on which the filing was submitted.

exhibit_21_version

Version of exhibit 21 submitted (if applicable).

report_date

Date reported.

_core_eia860__cooling_equipment

package: pudl

Information about cooling equipment at generation facilities.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 6D)

Primary key:

plant_id_eia, utility_id_eia, cooling_id_eia, report_date

Usage Warnings

  • This table has not been harvested with other EIA 923 or 860 data. The same variables present in this table may show up in other _core tables in other years.

Additional Details

A single plant can have several collections of cooling equipment, so each line in this table represents one cooling system at a given plant. You can determine how to link each set of cooling equipment to specific boilers in a plant via core_eia860__assn_boiler_cooling.

EIA-860 respondents are required to submit information about cooling equipment for plants over 100 MW - including steam, combined cycle, nuclear or solar thermal generators.

This table has been cleaned, but not tidied/normalized with other EIA 923 or 860 data. The same variables present in this table may show up in other tables and/or in other years. Once this table has been harvested, it will be converted into a``core`` table.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

plant_name_eia

Plant name.

cooling_id_eia

EIA Identification code for cooling system (if multiple cooling systems are not distinguished by separate IDs, the word 'PLANT' is listed to encompass the cooling system for the entire plant)

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

county

County name.

state

Two letter US state abbreviation.

chlorine_equipment_cost

Actual installed cost for the existing chlorine discharge control system or the anticipated cost to bring the chlorine discharge control system into commercial operation

chlorine_equipment_operating_date

Actual or projected in-service date for chlorine discharge control structures and equipment

cooling_equipment_total_cost

Actual installed cost for the existing system or the anticipated cost to bring the total system into commercial operation

cooling_status_code

Operating status of cooling system

cooling_system_operating_date

The actual or projected in-service datetime of this cooling system

cooling_type_1

Type of cooling system

cooling_type_2

Type of cooling system

cooling_type_3

Type of cooling system

cooling_type_4

Type of cooling system

cooling_water_discharge

Name of river, lake, or water source that cooling water is discharged into

cooling_water_source

Name of river, lake, or water source that provides cooling water

intake_distance_shore_feet

Maximum distance from shore to intake

intake_distance_surface_feet

Average distance below water surface to intake

intake_rate_100pct_gallons_per_minute

Design cooling water flow rate at 100 percent load at in-take

outlet_distance_shore_feet

Maximum distance from shore to outlet

outlet_distance_surface_feet

Average distance below water surface to outlet

percent_dry_cooling

Percent of cooling load served by dry cooling components

plant_summer_capacity_mw

The plant summer capacity associated with the operating generators at the plant

pond_cost

Actual installed cost for the existing cooling ponds or the anticipated cost to bring the cooling ponds into commercial operation

pond_operating_date

Cooling ponds actual or projected in-service date

pond_surface_area_acres

Total surface area of cooling pond

pond_volume_acre_feet

Total volume of water in cooling pond

power_requirement_mw

Maximum power requirement for cooling towers at 100 percent load

steam_plant_type_code

Code that describes types of steam plants from EIA 860. See steam_plant_types_eia table for more details.

tower_cost

Actual installed cost for the existing cooling towers or the anticipated cost to bring the cooling towers into commercial operation

tower_operating_date

Cooling towers actual or projected in-service date

tower_type_1

Types of cooling towers at this plant

tower_type_2

Types of cooling towers at this plant

tower_type_3

Types of cooling towers at this plant

tower_type_4

Types of cooling towers at this plant

tower_water_rate_100pct_gallons_per_minute

Maximum design rate of water flow at 100 percent load for the cooling towers

water_source_code

Type of cooling water source

water_source

Name of water source associated with the plant.

water_type_code

Type of cooling water

_core_eia860__fgd_equipment

package: pudl

Information about flue gas desulfurization equipment at generation facilities.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

EIA Form 860 -- Annual Electric Generator Report (Schedule 6E)

Primary key:

plant_id_eia, so2_control_id_eia, report_date

Usage Warnings

  • This table has not been harvested with other EIA 923 or 860 data. The same variables present in this table may show up in other _core tables in other years.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

so2_control_id_eia

Sulfur dioxide control identification number. This ID is not a unique identifier.

utility_id_eia

The EIA Utility Identification number.

utility_name_eia

The name of the utility.

state

Two letter US state abbreviation.

state_id_fips

Two digit state FIPS code.

county

County name.

county_id_fips

County ID from the Federal Information Processing Standard Publication 6-4.

fgd_operating_date

The actual or projected in-service datetime of this flue gas desulfurization system

fgd_operational_status_code

Operating status code for flue gas desulfurization equipment.

flue_gas_bypass_fgd

Indicates whether flue gas can bypass the flue gas desulfurization unit.

byproduct_recovery

Is saleable byproduct recovered by the unit?

sludge_pond

Indicates if there is a sludge pond associated with this unit.

sludge_pond_lined

Indicates whether the sludge pond is lined.

pond_landfill_requirements_acre_foot_per_year

Annual pond and land fill requirements for flue gas desulfurization equipment.

fgd_structure_cost

Actual installed costs for the existing systems or the anticipated costs of structures and equipment to bring a planned flue gas desulfurization system into commercial operation.

fgd_other_cost

Other actual installed costs for installation of a flue gas desulfurization unit or the anticipated other costs pertaining to the installation of a flue gas desulfurization unit.

sludge_disposal_cost

Actual installed costs for the existing sludge transport and disposal systems or the anticipated costs of sludge transport and disposal systems to bring a planned system into commercial operation.

total_fgd_equipment_cost

Total actual installed costs for the existing flue gas desulfurization unit or the anticipated costs to bring a planned flue gas desulfurization unit into commercial operation.

fgd_trains_100pct

Total number of flue gas desulfurization unit scrubber trains operated at 100 percent load.

fgd_trains_total

Total number of flue gas desulfurization unit scrubber trains.

flue_gas_entering_fgd_pct_of_total

Ratio of all flue gas that is entering the flue gas desulfurization unit.

flue_gas_exit_rate_cubic_feet_per_minute

Actual flue gas exit rate, in cubic feet per minute.

flue_gas_exit_temperature_fahrenheit

Flue gas exit temperature, in degrees Fahrenheit.

so2_emission_rate_lbs_per_hour

Sulfur dioxide emission rate when operating at 100 percent load (pounds per hour).

so2_equipment_type_1

Type of sulfur dioxide control equipment.

so2_equipment_type_2

Type of sulfur dioxide control equipment.

so2_equipment_type_3

Type of sulfur dioxide control equipment.

so2_equipment_type_4

Type of sulfur dioxide control equipment.

so2_removal_efficiency_design

Designed removal efficiency for sulfur dioxide when operating at 100 percent load. Reported at the nearest 0.1 percent by weight of gases removed from the flue gas.

specifications_of_coal_ash

Design fuel specifications for ash when burning coal or petroleum coke (nearest 0.1 percent by weight).

specifications_of_coal_sulfur

Design fuel specifications for sulfur when burning coal or petroleum coke (nearest 0.1 percent by weight).

sorbent_type_1

Type of sorbent used by this sulfur dioxide control equipment.

sorbent_type_2

Type of sorbent used by this sulfur dioxide control equipment.

sorbent_type_3

Type of sorbent used by this sulfur dioxide control equipment.

sorbent_type_4

Type of sorbent used by this sulfur dioxide control equipment.

fgd_manufacturer

Name of flue gas desulfurization equipment manufacturer.

fgd_manufacturer_code

Code corresponding to name of flue gas desulfurization equipment manufacturer.

steam_plant_type_code

Code that describes types of steam plants from EIA 860. See steam_plant_types_eia table for more details.

plant_summer_capacity_mw

The plant summer capacity associated with the operating generators at the plant

water_source

Name of water source associated with the plant.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

_core_eia923__monthly_cooling_system_information

package: pudl

Monthly time series of cooling system information at generation facilities - mainly water volumes and temperatures.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 8D)

Primary key:

plant_id_eia, report_date, cooling_id_eia

Usage Warnings

  • Some years use a slightly different data definition.

Additional Details

In 2008 and 2009, EIA only reports annual averages, but in later years all data is monthly.

Note: This table has been cleaned, but not harvested with other EIA 923 or 860 data. The same variables present in this table may show up in other _core tables in other years. Once this table has been harvested, it will be removed from the PUDL database.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

cooling_id_eia

EIA Identification code for cooling system (if multiple cooling systems are not distinguished by separate IDs, the word 'PLANT' is listed to encompass the cooling system for the entire plant)

cooling_status_code

Operating status of cooling system

cooling_type

Type of cooling system

monthly_total_cooling_hours_in_service

Total hours the system operated during the month

flow_rate_method

Calculation method for flow rates (actual or method of estimation)

temperature_method

Method for measurement of temperatures

annual_maximum_intake_summer_temperature_fahrenheit

Maximum cooling water temperature at intake during the summer

annual_maximum_intake_winter_temperature_fahrenheit

Maximum cooling water temperature at intake in winter

monthly_average_intake_temperature_fahrenheit

Average cooling water temperature at intake point

monthly_maximum_intake_temperature_fahrenheit

Maximum cooling water temperature at intake

annual_maximum_outlet_summer_temperature_fahrenheit

Maximum cooling water temperature at outlet in summer

annual_maximum_outlet_winter_temperature_fahrenheit

Maximum cooling water temperature at outlet in winter

monthly_average_discharge_temperature_fahrenheit

Average cooling water temperature at discharge point

monthly_maximum_discharge_temperature_fahrenheit

Maximum cooling water temperature at discharge

annual_average_consumption_rate_gallons_per_minute

Annual average consumption rate of cooling water

monthly_average_consumption_rate_gallons_per_minute

Monthly average consumption rate of cooling water

monthly_total_consumption_volume_gallons

Monthly volume of water consumed at consumption point (accurate to 0.1 million gal)

annual_average_discharge_rate_gallons_per_minute

Annual average discharge rate of cooling water

monthly_average_discharge_rate_gallons_per_minute

Monthly average discharge rate of cooling water

monthly_total_discharge_volume_gallons

Monthly volume of water discharged at discharge point (accurate to 0.1 million gal)

monthly_average_diversion_rate_gallons_per_minute

Monthly average diversion rate of cooling water

monthly_total_diversion_volume_gallons

Monthly volume of water diverted at diversion point (accurate to 0.1 million gal)

annual_average_withdrawal_rate_gallons_per_minute

Annual average withdrawal rate of cooling water

monthly_average_withdrawal_rate_gallons_per_minute

Monthly average withdrawal rate of cooling water

monthly_total_withdrawal_volume_gallons

Monthly volume of water withdrawn at withdrawal point (accurate to 0.1 million gal)

annual_total_chlorine_lbs

Amount of elemental chlorine added to cooling water annually. May be just the amount of chlorine-containing compound if schedule 9 is filled out.

monthly_total_chlorine_lbs

Amount of elemental chlorine added to cooling water monthly. May be just the amount of chlorine-containing compound if schedule 9 is filled out.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

_core_eia923__yearly_byproduct_disposition

package: pudl

Annual time series of by-products reported by thermoelectric power plants with total steam turbine capacity of 100 megawatts or greater.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 8A)

Primary key:

plant_id_eia, report_year, byproduct_description

Additional Details

All by-products other than steam sales are reported in thousand tons and are rounded to whole numbers. Steam sales are reported in MMBTu.

Columns
report_year

Four-digit year in which the data was reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

byproduct_description

Description of combustion by-product.

byproduct_units

Reported unit of measure for combustion byproduct. MMBtu for steam, tons for all other byproducts.

disposal_landfill_units

Disposed by-products in landfill, to the nearest hundred tons or in MMBtu for steam sales.

disposal_offsite_units

Disposed by-products offsite, to the nearest hundred tons or in MMBtu for steam sales.

disposal_ponds_units

Disposed by-products in ponds, to the nearest hundred tons or in MMBtu for steam sales.

sold_units

Sold by-products, in tons (to the nearest 100 tons) or, for Steam, MMBtu.

stored_offsite_units

Stored by-products offsite, to the nearest hundred tons or in MMBtu for steam sales.

stored_onsite_units

Stored by-products onsite, to the nearest hundred tons or in MMBtu for steam sales.

used_offsite_units

Used offsite by-products, to the nearest hundred tons or in MMBtu for steam sales.

used_onsite_units

Used onsite by-products, to the nearest hundred tons or in MMBtu for steam sales.

total_disposal_units

Total by-product disposal, to the nearest hundred tons or in MMBtu for steam sales.

no_byproducts_to_report

Whether any combustion by-products were produced by a plant. 'Y' indicates no byproducts to report. The 'Y' and 'N' values do not align with expected values of reported byproducts. This column is messy and requires standardization.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

_core_eia923__yearly_byproduct_expenses_and_revenues

package: pudl

Annual time series of financial information related to combustion by-products reported by thermoelectric power plants with total steam turbine capacity of 100 megawatts or greater and that produced combustion by-products during the reporting year.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 8B)

Primary key:

plant_id_eia, report_year

Additional Details

Cost data must be entered for all entries on Schedule 8A. Financial information includes operational and maintenance expenditures (opex), capital expenditures (capex), and by-products sales revenue.

Columns
plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

report_year

Four-digit year in which the data was reported.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

capex_air_abatement

Cost of new structures and/or equipment purchased to reduce, monitor, or eliminate airborne pollutants.

capex_other_abatement

Other amortizable expenses and purchases of new structures and or equipment when such purchases are not allocated to a particular unit or item.

capex_solid_waste

Cost of structures or equipment purchased to collect and dispose of objectionable solids or contained liquids.

capex_water_abatement

Cost of new structures and/or equipment purchased to reduce, monitor, or eliminate waterborne pollutants, including chlorine, phosphates, acids, bases, hydrocarbons, sewage, and other pollutants.

opex_bottom_ash_collection

Costs of materials and labor associated with the collection of bottom ash from all sources.

opex_bottom_ash_disposal

Costs of materials and labor associated with the disposal of bottom ash from all sources.

opex_bottom_ash_other

Other costs associated with the collection and disposal of bottom ash.

opex_fgd_byproduct_collection

Costs of materials and labor associated with the collection of sulfur by-product (flue gas desulfurization).

opex_fgd_byproduct_disposal

Costs of materials and labor associated with the disposal of sulfur by-product (flue gas desulfurization).

opex_fgd_byproduct_other

Other costs associated with the collection and disposal of sulfur by-product (flue gas desulfurization).

opex_fly_ash_collection

Costs of materials and labor associated with the collection of fly ash from all sources.

opex_fly_ash_disposal

Costs of materials and labor associated with the disposal of fly ash from all sources.

opex_fly_ash_other

Other costs associated with the collection and disposal of fly ash.

opex_other_abatement_collection

Abatement costs of by-product collection that are not allocated to a particular expenditure, e.g., costs of operating an environmental protection office.

opex_other_abatement_disposal

Abatement costs of by-product disposal that are not allocated to a particular expenditure.

opex_other_abatement_other

Other abatement costs that are not allocated to a particular expenditure.

opex_total_collection_abatement

Sum of abatement costs associated with by-product collection.

opex_total_disposal_abatement

Sum of abatement costs associated with by-product disposal.

opex_total_other_abatement

Sum of other abatement costs associated with the collection and disposal of byproducts.

opex_water_abatement_collection

Costs associated with the collection/abatement of water pollution, e.g., equipment operation and maintenance of pumps, pipes, and settling ponds.

opex_water_abatement_disposal

Costs associated with the disposal of water pollutants.

opex_water_abatement_other

Other abatement costs associated with water pollutants.

revenues_bottom_ash

Revenue from the sale of bottom ash by-product.

revenues_fgd_byproducts

Revenue from the sale of flue gas desulfurization by-product.

revenues_fly_ash

Revenue from the sale of fly ash by-product.

revenues_fly_bottom_ash_intermingled

Revenue from the sale of intermingled fly and bottom ash by-product.

revenues_other_byproducts

Revenue from the sale of other by-products.

revenues_total_byproduct

Total revenue from the sale of by-products.

_core_eia923__yearly_fgd_operation_maintenance

package: pudl

Annual time series of flue gas desulfurization (FGD) operation and maintenance expenses at generation facilities.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

EIA Form 923 -- Power Plant Operations Report (Schedule 8C)

Primary key:

plant_id_eia, report_date, so2_control_id_eia

Usage Warnings

  • Some years use a slightly different data definition.

Additional Details

From 2008-2011 this table also reported operational characteristics that are now reported in 923 Air Emissions Control Info.

Note: This table has been cleaned, but not harvested with other EIA 923 or 860 data. The same variables present in this table may show up in other _core tables in other years. Once this table has been harvested, it will be removed from the PUDL database.

Columns
report_date

Date reported.

plant_id_eia

The unique six-digit facility identification number, also called an ORISPL, assigned by the Energy Information Administration.

so2_control_id_eia

Sulfur dioxide control identification number. This ID is not a unique identifier.

opex_fgd_feed_materials_chemical

Annual operation and maintenance expenditures for feed materials and chemicals for flue gas desulfurization equipment, excluding electricity.

opex_fgd_labor_supervision

Annual operation and maintenance expenditures for labor and supervision of flue gas desulfurization equipment, excluding electricity.

opex_fgd_land_acquisition

Annual operation and maintenance expenditures for land acquisition for flue gas desulfurization equipment, excluding electricity.

opex_fgd_maintenance_material_other

Annual operation and maintenance expenditures for maintenance, materials and all other costs of flue gas desulfurization equipment, excluding electricity

opex_fgd_waste_disposal

Annual operation and maintenance expenditures for waste disposal, excluding electricity.

opex_fgd_total_cost

Annual total cost of operation and maintenance expenditures on flue gas desulfurization equipment, excluding electricity

fgd_control_flag

Indicates whether or not a plant has a flue gas desulfurization control unit.

fgd_operational_status_code

Operating status code for flue gas desulfurization equipment.

fgd_hours_in_service

Number of hours the flue gas desulfurization equipment was in operation during the year.

fgd_electricity_consumption_mwh

Electric power consumed by the flue gas desulfurization unit (in MWh).

fgd_sorbent_consumption_tons

Quantity of flue gas desulfurization sorbent used, to the nearest 100 ton.

so2_removal_efficiency_tested

Removal efficiency for sulfur dioxide (to the nearest 0.1 percent by weight) at tested rate at 100 percent load.

so2_removal_efficiency_annual

Removal efficiency for sulfur dioxide (to the nearest 0.1 percent by weight) based on designed firing rate and hours in operation (listed as a percentage).

so2_test_date

Date of most recent test for sulfur dioxide removal efficiency.

data_maturity

Maturity of the source data published by EIA that is reflected in this record. EIA releases data incrementally over time, including monthly updates, annual year-to-date updates, provisional early releases of annual data, and final annual release data that is not expected to change further. Records sourced from multiple upstream EIA datasets may have no well defined data maturity. Records whose values have been inferred within PUDL will also have no data maturity.

_core_phmsagas__yearly_distribution_by_install_decade

package: pudl

Annual time series of miles of mains and the number of services in operation at the end of the year by install decade.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 4)

Primary key:

report_id, report_date, operator_id_phmsa, operating_state, install_decade

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

Additional Details

The records with an install decade of total_decade are a total - beware of aggregating these values.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

install_decade

The decade the distribution pipeline was installed.

mains_miles

The miles of mains distribution pipeline.

services

Number of end in system at end of year.

_core_phmsagas__yearly_distribution_by_material

package: pudl

Annual time series of miles of mains and the number of services in operation at the end of the year by material for each gas distribution operator.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 1 - General)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state and material. However, there are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

material

The material of the gas distribution pipe. The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).

mains_miles

The miles of mains distribution pipeline.

services

Number of end in system at end of year.

_core_phmsagas__yearly_distribution_by_material_and_size

package: pudl

Annual time series of miles of mains and the number of services in operation at the end of the year by material and size of pipe.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B - System Description / Section 3)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state, main_size and material. There are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • The size ranges in main_size have changed slightly over the years (ex: before 1984 they reported 0.5_in_or_less whereas after they reported 1_in_or_less)

  • The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_date

Date reported.

report_id

Report number of the PHMSA Gas utility submission.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

main_size

Size range of mains. The size ranges have changed slightly over the years (ex: before 1984 they reported 0.5_in_or_less whereas after they reported 1_in_or_less).

material

The material of the gas distribution pipe. The categories of material types have changed slightly over the years (ex: cast and wrought iron were broken up in two categories before 1984).

mains_miles

The miles of mains distribution pipeline.

services

Number of end in system at end of year.

main_other_material_detail

A free-form text field containing notes about the other material type. This column should only contain values in it for rows with other as the material type listed.

_core_phmsagas__yearly_distribution_excavation_damages

package: pudl

Annual time series of excavation damages from various sources.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part D - Excavation Damage)

Primary key:

report_id, damage_type, damage_sub_type

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

damage_type

A high level category of excavation damage causes.

damage_sub_type

A sub-category of damage_type of excavation damage causes.

damages

Number of instances of excavation damage.

_core_phmsagas__yearly_distribution_filings

package: pudl

Annual time series of filings (aka submissions) from gas distribution system operators.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report

Primary key:

report_id, report_date, operator_id_phmsa

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

Additional Details

This table contains information about the filer and filing type. This includes information about who filed but also whether this was an original filing or a correction.

Columns
report_id

Report number of the PHMSA Gas utility submission.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

report_date

Date reported.

filing_date

Date on which the filing was submitted.

initial_filing_date

Initial date when filing was originally submitted.

filing_correction_date

Date when a correction filing was submitted.

report_filing_type

Type of report submitted, either Initial or Supplemental.

data_date

When the data source was last updated.

form_revision_id

PHMSA form revision identifier.

preparer_name

Name of representative who filed report.

preparer_title

Title of representative who filed report.

preparer_phone

Phone number of representative who filed report.

preparer_fax

Fax number of representative who filed report.

preparer_email

Email address of representative who filed report.

_core_phmsagas__yearly_distribution_leaks

package: pudl

Annual time series of total and hazardous leaks eliminated or repaired during the report year.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part C)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, operating_state, leak_severity and leak_source. There are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Some columns contain subtotals; use caution when choosing columns to aggregate.

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_id

Report number of the PHMSA Gas utility submission.

report_date

Date reported.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

commodity

The type of gas delivered by the distribution pipeline.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

leak_severity

Whether or not the leak described in this record are all leaks or hazardous leaks.

leak_source

The cause of the leaks.

mains

The number of mains distribution pipeline.

services

Number of end in system at end of year.

_core_phmsagas__yearly_distribution_misc

package: pudl

Annual time series of miscellaneous distribution information.

Processing:

Data has been cleaned but not tidied/normalized. Published only temporarily and may be removed without notice.

Source:

Pipelines and Hazardous Materials Safety Administration (PHMSA) Annual Natural Gas Report (Part B & C)

Primary key:

This table has no primary key. We expect the primary key for this table should be report_id, operator_id_phmsa, and operating_state. There are nulls in the operating_state across several years of reporting.

Usage Warnings

  • This table has been concatenated across all years and re-organized into a logical structure, but the data has not been fully cleaned. Except some inconsistent units, data types and values over the years of reported data. Once fully cleaned, this table will be deprecated and replaced with a core table.

  • Beginning in 2004, companies file one report per state. The operating_state column has not been normalized and may contain more than one state in earlier years of data.

Columns
report_date

Date reported.

report_id

Report number of the PHMSA Gas utility submission.

operator_id_phmsa

PHMSA unique operator ID. A value of zero represents an unknown operator ID.

operating_state

State that the distribution utility is reporting for. Prior to 2004, this may be a list of states.

all_known_leaks_scheduled_for_repair

The number of known system leaks at the end of the report year scheduled for repair.

all_known_leaks_scheduled_for_repair_main

The number of known leaks on main at the end of the report year scheduled for repair.

hazardous_leaks_mechanical_joint_failure

The total number of hazardous leaks caused by a mechanical joint failure.

federal_land_leaks_repaired_or_scheduled

Total number of leaks repaired, eliminated, or scheduled for repair on federal land during the reporting year.

average_service_length_feet

The average system service length in feet.

services_efv_in_system

Estimated number of services with Excess Flow Valve in the system at end of reported year related to natural gas distribution.

services_efv_installed

Total number of services with Excess Flow Valve installed during reported year related to natural gas distribution.

services_shutoff_valve_in_system

Estimated number of services with manual service line shut-off valves installed in the system at end of report year related to natural gas distribution.

services_shutoff_valve_installed

Total number of manual service line shut-off valves installed during reported year related to natural gas distribution.

unaccounted_for_gas_fraction

Unaccounted for gas as a fraction of total consumption for the 12 months ending June 30 of the reporting year. Calculated as follows: Take the sum of: (purchased gas + produced gas) minus (customer use + company use + appropriate adjustments). Then divide by the sum of (customer use + company use + appropriate adjustments). Prior to 2017, this field was calculated with a different deonominator (purchased gas + produced gas). The time period between 2010-2017 having this different calculation method ensured that there was no records that had a negative fraction. For all the other reporting years there are known and expected negative values in this column.

excavation_tickets

Number of Excavation Tickets received by the operator during the year, (i.e., receipt of information by the operator from the notification center).

Showing rows out of rows that match your filters