Data Dictionary

Documentation of all data fields, enums, and methodology notes for PoweredByWho data exports.

studies.csv

Each row represents one published opinion research study or poll.

FieldTypeDescription
iduuidUnique identifier for the study.
titletextTitle of the study or poll.
source_urlurlLink to the original source document.
archive_urlurl | nullLink to an archived copy (e.g. Wayback Machine).
pollstertext | nullOrganization that conducted the fieldwork.
sponsortext | nullOrganization that commissioned or funded the study.
sponsor_categoryenumCategory of the sponsor. See Sponsor Category enum.
field_startdate | nullDate fieldwork began (YYYY-MM-DD).
field_enddate | nullDate fieldwork ended (YYYY-MM-DD).
publish_datedate | nullDate the study was published.
geography_statetext | nullTwo-letter U.S. state code (e.g. VA, TX).
geography_countytext | nullCounty name, if study covers a specific county.
geography_citytext | nullCity name, if study covers a specific city.
geography_othertext | nullFree-text geography description for non-standard areas.
modetext | nullSurvey mode (e.g. online, phone, mixed).
populationtext | nullTarget population (e.g. registered voters, adults, likely voters).
sample_ninteger | nullTotal sample size.
moefloat | nullMargin of error as a percentage (e.g. 3.5 means +/- 3.5%).
extraction_confidenceenumConfidence in the accuracy of extracted data. See Extraction Confidence enum.
review_statusenumReview workflow status. Only 'published' items appear in public exports.

questions.csv

Each row represents one question extracted from a study. Join to studies on study_id.

FieldTypeDescription
iduuidUnique identifier for the question.
study_iduuidForeign key to the parent study.
question_text_verbatimtextExact question wording as it appeared in the source.
question_typeenumType of question. See Question Type enum.
topic_tagstext[]Array of topic tags. See Topic Tags list.
cluster_iduuid | nullForeign key to a question cluster for cross-study comparison.
sort_orderintegerOrder of the question within the study.

results.csv

Each row represents one response option for a question. Includes both topline ("Overall") and crosstab results. Join to questions on question_id.

FieldTypeDescription
iduuidUnique identifier for the result row.
question_iduuidForeign key to the parent question.
subgroup_labeltextSubgroup label. 'Overall' for topline results, otherwise describes the crosstab subgroup.
response_optiontextThe response choice (e.g. 'Support', 'Oppose', 'Strongly support').
pct_valuefloat | nullPercentage value (e.g. 52.3 means 52.3%).
base_ninteger | nullNumber of respondents for this subgroup.
sort_orderintegerOrder of the result within its question and subgroup.

projects.csv

Each row represents one data center project tracked by PoweredByWho.

FieldTypeDescription
iduuidUnique identifier for the project.
canonical_nametextProject name used for display.
developertext | nullCompany developing the data center.
operatortext | nullCompany that will operate the data center, if different from developer.
status_currentenumCurrent project status. See Project Status enum.
status_current_datedate | nullDate when the current status was last confirmed.
statetextTwo-letter U.S. state code.
countytext | nullCounty name.
citytext | nullCity name.
latfloat | nullLatitude coordinate.
lngfloat | nullLongitude coordinate.
geography_precisionenumPrecision of the coordinates. See Geography Precision enum.
capacity_mwfloat | nullPlanned or actual power capacity in megawatts.
capacity_mw_source_confidenceenum | nullConfidence in the MW figure. See MW Confidence enum.
source_urlurl | nullLink to the primary source for this project.
archive_urlurl | nullArchived copy of the source.

project_events.csv

Timeline events associated with projects. Join to projects on project_id.

FieldTypeDescription
iduuidUnique identifier for the event.
project_iduuidForeign key to the parent project.
event_typetextType of event (e.g. 'permit_filed', 'public_hearing', 'construction_start').
event_datedate | nullDate of the event.
source_urlurl | nullLink to the source for this event.
archive_urlurl | nullArchived copy of the event source.
extraction_confidenceenumConfidence in the extracted event data.
notes_verbatimtext | nullVerbatim notes extracted from the source.

Enums & Allowed Values

Sponsor Category

ValueLabelDescription
industryIndustryData center companies, tech firms, or industry trade groups.
advocacyAdvocacyIssue advocacy organizations, community groups, or political organizations.
mediaMediaNews organizations that commission polls.
academicAcademicUniversities or research institutions.
governmentGovernmentGovernment agencies or officials.
unknownUnknownSponsor could not be determined from available information.

Question Type

ValueLabelDescription
support_opposeSupport/OpposeQuestions asking whether respondents support or oppose a project or policy.
favorabilityFavorabilityQuestions asking about favorable/unfavorable views.
concernConcernQuestions measuring level of concern about impacts.
tradeoffTradeoffQuestions presenting tradeoffs (e.g. jobs vs. environmental impact).
awarenessAwarenessQuestions measuring awareness of projects or issues.
otherOtherQuestions that do not fit other categories.

Project Status

ValueLabelDescription
announcedAnnouncedProject has been publicly announced but no permits filed.
proposedProposed / In PermittingProject is in the permitting or planning process.
under_constructionUnder ConstructionConstruction is underway.
operationalOperationalData center is operating.
canceledCanceledProject has been canceled or abandoned.
on_holdOn HoldProject is paused or delayed.

Extraction Confidence

ValueLabelDescription
highHighData was clearly stated in the source and extraction is very likely correct.
mediumMediumData required some interpretation but is likely correct.
lowLowData was ambiguous or required significant interpretation.

Geography Precision

ValueLabelDescription
siteSite AddressCoordinates are for a specific site address.
cityCity CentroidCoordinates are approximate, placed at the city center.
countyCounty CentroidCoordinates are approximate, placed at the county center.

Topic Tags

Questions may have zero or more of the following topic tags:

EnergyWaterNoiseJobs/TaxesZoningEnvironmentGridElectricity CostsEconomic ImpactIncentivesMoratoriumRegulationAI InfrastructureCommunity ImpactProperty ValuesTraffic

Caveats & Methodology

Coverage

PoweredByWho does not claim to be a comprehensive database of all data center opinion research. Coverage is limited to studies and projects discovered through a curated list of allowlisted public sources. New sources are added over time.

Data Extraction

Study data is extracted from source documents using a combination of automated processing and human review. The extraction_confidence field indicates the reliability of the extraction. All items are reviewed before being published.

Question Verbatim Text

Question text is reproduced verbatim from the source document to the extent possible. Some formatting may be lost in extraction. Always refer to the original source for the authoritative wording.

Project Coordinates

Project latitude and longitude may be approximate. The geography_precision field indicates whether coordinates are for a specific site, a city centroid, or a county centroid. Do not rely on these coordinates for precise location purposes.

Capacity Figures

MW capacity figures are sourced from public reporting and may represent planned, permitted, or operational capacity depending on the project stage. The capacity_mw_source_confidence field indicates the reliability of the figure.

Updates

Data is refreshed from sources approximately twice daily. New studies and projects are added as they are discovered and pass editorial review.

Citation

When citing data from PoweredByWho, please reference both PoweredByWho as the aggregator and the original source study or document. Each study detail page includes the source URL for direct citation.