Previous CBECS Survey Data 2018 | 2012 | 2003 | 1999 | 1995 | 1992 | Previous
1989
- Building Characteristics Tables
- Consumption and Expenditures Tables
- Microdata
- BLDGID4: building identifier, which is the link between files;
- ADJWT4: adjusted sampling weight;
- STRATUM4 and PAIR4: variance stratum and pair member which can be used for calculating variances;
- REGION4 and CENDIV4: Census region and division;
- SQFT4 and SQFTC4: square footage, both exact and category;
- PBA4: principal building activity;
- YRCONC4: year constructed category; and
- ELSUPL4, NGSUPL4, FKSUPL4, STSUPL4, HWSUPL4: a set of variables indicating whether electricity, natural gas, fuel oil, district steam or district hot water were used in the building.
Released: January 2009
The 1989 CBECS Public Use Files are comma separated value (.csv) files that each contain 5,876 records. They represent commercial buildings from the 50 States and the District of Columbia. Each record corresponds to a single responding, in-scope sampled building, and contains information for that building such as building size, year constructed, type of energy used, energy-using equipment, and conservation features.
The smallest level of geographic detail available is the Census division, of which there are nine in the U.S. No state level data are available.
Layout Files | Data Files | Revised Date | |
---|---|---|---|
File 1: Summary File | TXT | CSV | 12/08 |
File 2: Building Activity | TXT | CSV | 12/08 |
File 3: Operating Hours and Weather | TXT | CSV | 12/08 |
File 4: Building Shell, Equipment, and Multibuilding Facilities | TXT | CSV | 12/08 |
File 5: End Uses of Major Energy Sources | TXT | CSV | 12/08 |
File 6: End Uses of Minor Energy Sources | TXT | CSV | 12/08 |
File 7: Lighting and Conservation Features | TXT | CSV | 12/08 |
File 8: Electricity | TXT | CSV | 12/08 |
File 9: Natural Gas | TXT | CSV | 12/08 |
File 10: Fuel Oil | TXT | CSV | 12/08 |
File 11: District Steam and Hot Water | TXT | CSV | 12/08 |
File 12: District Chilled Water and Sum of Major Fuels | TXT | CSV | 12/08 |
File 13: Imputation Flags for Summary Data, Building Activity, Operating Hours, Shell and Equipment | TXT | CSV | 12/08 |
File 14: Imputation Flags for End Uses | TXT | CSV | 12/08 |
File 15: Imputation Flags for Lighting and Conservation Features | TXT | CSV | 12/08 |
All Format Codes | TXT | 12/08 | |
All Layout Files and Format Codes | 12/08 | ||
Questionnaire | 2/09 |
File Organization
Because of the size of the CBECS questionnaire, the variables were separated into groups by subject matter. These 15 smaller files make it easier to manipulate the data.
Several variables are frequently used in the analysis of commercial energy data. These core variables are included in each group of variables:
For each group of variables, there are two items: a layout file and a data file. The layout file is a text file which gives, for each variable on a file: the variable name, a description, the position on the file, and the corresponding format. The data file is a comma separated value file.
To determine what the different values for each variable represent, use the text file provided of all the format codes (alternatively, there is also a PDF document containing all the layout files and format codes). The formats are arranged in alphabetical order and are written so that they may be easily turned into a SAS format library.
Each of these 15 files can be used by itself or be merged with other files for more complex analyses. By merging files together, a new file can be created that contains, for each respondent, variables from two or more files. The variable BLDGID4 should be used to link the files.
The CBECS sample was designed so that survey responses could be used to estimate characteristics of the entire commercial buildings stock nationwide. All published CBECS tables report national estimates.
In order to arrive at national estimates from the CBECS sample, base sampling weights were calculated for each building (these are the reciprocal of the probability of that building being selected into the sample). Therefore, a building with a base weight of 1,000 represents itself and 999 similar, but unsampled buildings in the total building stock. The base weight is further adjusted to account for nonresponse bias. The variable ADJWT4 in the data file is the final weight. In order to obtain a national estimate, each sample building's value must be multiplied by the building's weight.
Imputation Flags or Z variables
Files 13 through 15 contain variables that begin with the letter Z. These "Z variables" are also referred to as "imputation flags." Imputation is a statistical procedure used to fill in values for missing items. Missing values for many, but not all, of the variables were imputed in 1989. The imputation flag indicates whether the corresponding non-Z variable was reported, imputed, or inapplicable. There are no corresponding "Z variables" for variables from the CBECS questionnaire which were not imputed, variables where there was no missing data, and variables which were derived based on other variables.
Confidentiality of Survey Respondents
Several variables have been modified to protect the confidentiality of respondents. This note escribes the procedures used.
Square Footage: The numeric square footage (Question B-1) has been modified in two ways, depending on the size of the building. For buildings over one million square feet, the numeric square footage has been replaced with the weighted average square footage of all responding buildings over one million square feet. Separate weighted means were calculated for each of the four Census regions (Northeast, Midwest, South, and West). For buildings one million square feet or less, the numeric square footage has been rounded to within ten percent of the upper limit of the buildings' square footage categories (Question B-2). However, if the rounded value fell below the lower limit of the category, the value was coded at this lower limit. For example, buildings in the range 5,001 to 10,000 square feet were rounded to the nearest 1,000 square feet (except that buildings rounding to 5,000 were coded as 5,001).
Number of Workers: For buildings where the numeric number of workers (Question E-11) was 5,000 or more, the reported numeric number of workers has been replaced with the weighted average number of workers of all responding buildings with 5,000 or more workers. Separate weighted means were calculated for each of the four Census regions (Northeast, Midwest, South, and West).
Number of Floors: The upper range of the number of floors (Question F-4) has been replaced with two categories: 15 to 25 floors (coded as 994 on the file) and over 25 floors (coded as 995 on the file).
Special Measures of Occupancy: Five special measures of occupancy are included in the 1989 CBECS. They are classroom seating capacity (B-7m) for education buildings, seating capacity (B-7n) for food service buildings, licensed bed capacity (B-7o) for in-patient health care buildings, licensed bed capacity (B-7p) for skilled nursing buildings, and number of guest rooms (B-7q) for lodging buildings. These measures were each rounded in the following fashion:
Fewer than 25 units
25 to 49 units
50 to 99 units
100 to 249 units
250 to 499 units
500 to 999 units
1,000 to 2,499 units
2,500 to 4,999 units
5,000 or more units
no rounding performed
rounded to nearest 5
rounded to nearest 10
rounded to nearest 25
rounded to nearest 50
rounded to nearest 100
rounded to nearest 250
rounded to nearest 500
rounded to nearest 1,000
Weather Variables: Heating and cooling degree-days which have a base 65 degrees Fahrenheit (F) are included on the data files. Also included on the data file are the annual mean and standard deviation of daily average temperatures. These can be used to compute approximate degree-days at any base temperature of interest, using a Gaussian (normal) approximation to the distribution of daily average temperatures.
The heating degree-day variable has been inflated or deflated by a random percentage, normally distributed with mean zero and standard deviation 2.0. The mean and standard deviation of temperature and the base 65 degrees Fahrenheit (F) cooling degree-days have been modified to be consistent with the modified heating degree-days.
1986
- Building Characteristics Tables
- Consumption and Expenditures Tables
- Microdata
- BLDGID3: building identifier, which is the link between files;
- ADJWT3: adjusted sampling weight;
- STRATUM3 and PAIR3: variance stratum and pair member which can be used for calculating variances;
- REGION3 and CENDIV3: Census region and division;
- SQFTC3: square footage category;
- PBA3: principal building activity;
- YRCONC3: year constructed category; and
- ELSUPL3, NGSUPL3, FKSUPL3, PRSUPL3, STSUPL3, HWSUPL3: a set of variables indicating whether electricity, natural gas, fuel oil, propane, district steam or district hot water were used in the building.
Released: January 2009
The 1986 CBECS Public Use Files are comma separated value (.csv) files that each contain 6,072 records. They represent commercial buildings from the 50 States and the District of Columbia. Each record corresponds to a single responding, in-scope sampled building, and contains information for that building such as building size, year constructed, type of energy used, energy-using equipment, and conservation features.
The smallest level of geographic detail available is the Census division, of which there are nine in the U.S. No state level data are available.
1986 CBECS Building Characteristics | layout files | data files | revised date |
---|---|---|---|
File 1: Summary File | TXT | CSV | 2/09 |
File 2: Building Activity | TXT | CSV | 2/09 |
File 3: Operating Hours | TXT | CSV | 2/09 |
File 4: Building Shell, Equipment, Energy Audits, and "Other" Conservation Features | TXT | CSV | 2/09 |
File 5: End Uses of Major Energy Sources | TXT | CSV | 2/09 |
File 6: End Uses of Minor Energy Sources | TXT | CSV | 2/09 |
File 7: HVAC, Lighting and Building Shell Conservation Features |
TXT | CSV | 2/09 |
File 8: Electricity | TXT | CSV | 2/09 |
File 9: Natural Gas and Fuel Oil | TXT | CSV | 2/09 |
File 10: District Steam and Hot Water | TXT | CSV | 2/09 |
File 11: Propane and District Chilled Water |
TXT | CSV | 2/09 |
File 12: Imputation Flags for Summary Data, Building Activity, Operating Hours, Shell and Equipment | TXT | CSV | 2/09 |
File 13: Imputation Flags for Energy Audits, "Other" Conservation Features and End Uses |
TXT | CSV | 2/09 |
File 14: Imputation Flags for HVAC, Lighting, and Shell Conservation Features | TXT | CSV | 2/09 |
All Format Codes | TXT | 2/09 | |
All Layout Files and Format Codes | 2/09 | ||
Questionnaire | 2/09 |
File Organization
Because of the size of the CBECS questionnaire, the variables were separated into groups by subject matter. These 14 smaller files make it easier to manipulate the data.
Several variables are frequently used in the analysis of commercial energy data. These core variables are included in each group of variables:
For each group of variables, there are two items: a layout file and a data file. The layout file is a text file which gives, for each variable on a file: the variable name, a description, the position on the file, and the corresponding format. The data file is a comma separated value file.
To determine what the different values for each variable represent, use the text file provided of all the format codes (alternatively, there is also a PDF document containing all the layout files and format codes). The formats are arranged in alphabetical order and are written so that they may be easily turned into a SAS format library.
Each of these 14 files can be used by itself or be merged with other files for more complex analyses. By merging files together, a new file can be created that contains, for each respondent, variables from two or more files. The variable BLDGID3 should be used to link the files.
The CBECS sample was designed so that survey responses could be used to estimate characteristics of the entire commercial buildings stock nationwide. All published CBECS tables report national estimates.
In order to arrive at national estimates from the CBECS sample, base sampling weights were calculated for each building (these are the reciprocal of the probability of that building being selected into the sample). Therefore, a building with a base weight of 1,000 represents itself and 999 similar, but unsampled buildings in the total building stock. The base weight is further adjusted to account for nonresponse bias. The variable ADJWT3 in the data file is the final weight. In order to obtain a national estimate, each sample building's value must be multiplied by the building's weight.
Imputation Flags or Z variables
Files 12 through 14 contain variables that begin with the letter Z. These "Z variables" are also referred to as "imputation flags." Imputation is a statistical procedure used to fill in values for missing items. Missing values for many, but not all, of the variables were imputed in 1986. The imputation flag indicates whether the corresponding non-Z variable was reported, imputed, or inapplicable. There are no corresponding "Z variables" for variables from the CBECS questionnaire which were not imputed, variables where there was no missing data, and variables which were derived based on other variables.
Confidentiality of Survey Respondents
Several variables have been modified to protect the confidentiality of respondents. This note describes the procedures used.
Square Footage: The numeric square footage (Question B-1) has been modified in two ways, depending on the size of the building. For buildings over one million square feet, the numeric square footage has been replaced with the weighted average square footage of all responding buildings over one million square feet. Separate weighted means were calculated for each of the four Census regions (Northeast, Midwest, South, and West). For buildings one million square feet or less, the numeric square footage has been rounded to within ten percent of the upper limit of the buildings' square footage categories (Question B-2). However, if the rounded value fell below the lower limit of the category, the value was coded at this lower limit. For example, buildings in the range 5,001 to 10,000 square feet were rounded to the nearest 1,000 square feet (except that buildings rounding to 5,000 were coded as 5,001).
Number of Workers: For buildings where the numeric number of workers (Question C-3) was 5,000 or more, the reported numeric number of workers has been replaced with the weighted average number of workers of all responding buildings with 5,000 or more workers. Separate weighted means were calculated for each of the four Census regions (Northeast, Midwest, South, and West).
Number of Floors: The upper range of the number of floors (Question D-3) has been replaced with two categories: 15 to 25 floors (coded as 15 on the file) and over 25 floors (coded as 26 on the file).
Special Measures of Occupancy: Five special measures of occupancy are included in the 1986 CBECS. They are classroom seating capacity (B-10m) for education buildings, seating capacity (B-10n) for food service buildings, licensed bed capacity (B-10o) for in-patient health care buildings, licensed bed capacity (B-10p) for skilled nursing buildings, and number of guest rooms (B10q) for lodging buildings. These measures were each rounded in the following fashion:
Fewer than 25 units
25 to 49 units
50 to 99 units
100 to 249 units
250 to 499 units
500 to 999 units
1,000 to 2,499 units
2,500 to 4,999 units
5,000 or more units
no rounding performed
rounded to nearest 5
rounded to nearest 10
rounded to nearest 25
rounded to nearest 50
rounded to nearest 100
rounded to nearest 250
rounded to nearest 500
rounded to nearest 1,000
Weather Variables: Heating and cooling degree-days which have a base 65 degrees Fahrenheit (F) are included on the data files. Also included on the data file are the annual mean and standard deviation of daily average temperatures. These can be used to compute approximate degree-days at any base temperature of interest, using a Gaussian (normal) approximation to the distribution of daily average temperatures.
The heating degree-day variable has been inflated or deflated by a random percentage, normally distributed with mean zero and standard deviation 2.0. The mean and standard deviation of temperature and the base 65 degrees Fahrenheit (F) cooling degree-days have been modified to be consistent with the modified heating degree-days.
1983
- Building Characteristics Tables
- Consumption and Expenditures Tables
- Microdata
- BLDGID2: building identifier, which is used to link files;
- XSECWT2: cross-sectionally adjusted sampling weight (use LONGWT2 to estimate changes between 1979 and 1983);
- STR402 and PAIR402: variance stratum and pair member which can be used for calculating variances;
- REGION2: Census region;
- SQFTC2: square footage category;
- BCWM2C: principal building activity;
- YRCONC2: year constructed category; and
- ELSUPL2N, NGSUPL2N, FKSUPL2N, PRSUPL2N and STSUPL2N: a set of variables indicating whether electricity, natural gas, fuel oil, propane or district steam were used in the building.
Released: July 2010
The 1983 Commercial Buildings Energy Consumption Survey (CBECS) Public Use Files are comma separated value (.csv) files where each record corresponds to a single responding, in-scope sampled building. Each of the 7,140 records contains information for that building such as building size, year constructed, type of energy used, energy-using equipment and conservation features.
The microdata files represent commercial buildings from the contiguous United States and the District of Columbia. The smallest level of geographic detail available is the Census region; no Census division or state level data are available.
Both the 1979 and 1983 CBECS included buildings that were 1,000 square feet or less and included predominantly residential or industrial buildings if commercial activity was present. Beginning in 1986, CBECS only includes buildings larger than 1,000 square feet that are predominantly commercial.
layout files | data files | revised date | |
---|---|---|---|
File 1: Building Characteristics | TXT | CSV | 6/10 |
File 2: Imputation Flags for Building Characteristics | TXT | CSV | 6/10 |
File 3: Consumption and Expenditures | TXT | CSV | 6/10 |
File 4: Longitudinal Variables (transferred from 1979 survey) | TXT | CSV | 6/10 |
All Format Codes | TXT | 6/10 | |
All Layout Files and Format Codes | TXT | 6/10 | |
Questionnaire (questionnaire begins on page 219) | 6/10 |
Because of the size of the CBECS questionnaire, the master file was divided into 4 files to make it easier to manipulate the data. The 1983 sample included all buildings that had been selected for the 1979 CBECS, as well as a sample of new buildings constructed since the 1979 CBECS. File 4 includes the 1979 values for buildings that participated in both the 1979 and 1983 CBECS. The 4 files are grouped by subject matter and each contains core variables that are particularly useful to analyze the data. The core variables are the following:
For each group of variables, two items are provided: a layout file and a data file. The layout file is a text file that gives, for each variable on a file: the variable name, a description, and the corresponding format. The data file is a comma separated value file.
To determine what the different values for each variable represent, use the text file "All Format Codes" (alternatively, use the text file "All Layout Files and Format Codes"). The formats are arranged in alphabetical order and are written so that they may be easily turned into a SAS format library.
Each of the 4 data files can be used by itself or merged with other files for more complex analyses. By merging files, a new file can be created that contains, for each respondent, variables from two or more files. The variable BLDGID2 is used to link the files.
The CBECS sample was designed so that survey responses could be used to estimate characteristics of the entire commercial buildings stock nationwide. All published CBECS tables report national estimates.
In order to arrive at national estimates from the CBECS sample, base sampling weights were calculated for each building (these are the reciprocal of the probability of that building being selected into the sample). Therefore, a building with a base weight of 1,000 represents itself and 999 similar, but unsampled buildings in the total building stock. The base weight is further adjusted to account for nonresponse bias. The variables XSECWT2 and LONGWT2 in the data file are the final weight for cross-sectional and longitudinal estimates, respectively. In order to obtain a national estimate, each sample building's value must be multiplied by the building's weight.
To find the national estimate for... | Do this... | And you should get... |
---|---|---|
Total number of buildings | Sum XSECWT2 | 4,243,680.04 (or 4,244 thousand) buildings |
Total number of office buildings | Sum XSECWT2 for cases where BCWM2C="11" | 575,078.78 (or 575 thousand) office buildings |
Total floorspace | Create a new variable (weighted square footage) by multiplying XSECWT2 by SQFT2 for each case, then sum this new variable | 60,747,713,493.92 (or 60,784 million) square feet |
Total electricity consumption in thousand Btu | Create a new variable (weighted electricity consumption) by multiplying XSECWT2 by ELBTU2 for each case, then sum this new variable | 3,372,422,042,097.23 (or 3,372 trillion) Btu |
Total number of buildings discontinuing use of fuel oil or kerosene between 1979 and 1983 | Create a new variable indicating fuel oil or kerosene use in 1979 (FKUSED1T=’1”) but not in 1983 (FKSUPL2N=’2’), multiplied by LONGWT2 for each case, then sum. | 320,326.63 (or 320 thousand) buildings |
Total number of commercial buildings (excluding agricultural, industrial, and residential buildings) | Sum XSECWT2 for cases where BCWM2C does not equal 1, 7, or 13 | 3,711,810.10 (or 3,712 thousand) commercial buildings |
Imputation Flags
File 2 contains "imputation flags," variables that begin with ”IMP.” Imputation is a statistical procedure used to supply values for missing items. Missing values for many, but not all, of the variables were imputed. The imputation flag indicates whether the corresponding data item was reported, imputed, or inapplicable. There are no corresponding imputation flags for variables from the CBECS questionnaire which were not imputed, variables where there was no missing data, and variables which were derived based on other variables.
Confidentiality of Survey Respondents
Several variables have been modified to protect the confidentiality of respondents. This section describes the procedures used.
Square Footage: The reported value of square footage has been modified in two ways, depending on the size of the building. For buildings over one million square feet, the numeric square footage has been replaced with the weighted average square footage of all responding buildings over one million square feet. Separate weighted means were calculated for each of the four Census regions (Northeast, Midwest, South and West). For buildings one million square feet or less, the numeric square footage has been rounded to within ten percent of the upper limit of the buildings' square footage categories. However, if the rounded value fell below the lower limit of the category, the value was coded at this lower limit. For example, buildings in the range 5,001 to 10,000 square feet were rounded to the nearest 1,000 square feet (except that buildings rounding to 5,000 were coded as 5,001).
Number of Workers: For buildings where the numeric number of workers was 5,000 or more, the reported numeric number of workers has been replaced with the weighted average number of workers of all responding buildings with 5,000 or more workers. Separate weighted means were calculated for each of the four Census regions (Northeast, Midwest, South and West).
Number of Floors: The upper range of the number of floors has been replaced with two categories: 15 to 25 floors (coded as 15 on the file) and over 25 floors (coded as 26 on the file).
Weather Variables: Heating and cooling degree-days which have a base 65 degrees Fahrenheit are included on the data files. The degree-day variables have been inflated or deflated by a random percentage, normally distributed with mean zero and standard deviation 2.0.
1979
- Building Characteristics Tables:
- Consumption and Expenditures Tables:
- Microdata
- BLDGID1: building identifier, which is used to link files;
- XSECWT1: adjusted sampling weight;
- STRATUM1 and PAIR1: variance stratum and pair member which can be used for calculating variances;
- REGION1: Census region;
- SQFTC1: square footage category;
- BCWM1: principal building activity;
- YRCONC1: year constructed category; and
- ELSUPL1, NGSUPL1, FKSUPL1, PRSUPL1 and STSUPL1: a set of variables that indicate whether electricity, natural gas, fuel oil, propane or district steam were used in the building.
Released: July 2010
The 1979 Commercial Buildings Energy Consumption Survey (CBECS) Public Use Files are comma separated value (.csv) files where each record corresponds to a single responding, in-scope sampled building. Each of the 6,221 records contains information for that building such as building size, year constructed, type of energy used, energy-using equipment and conservation features.
The microdata files represent commercial buildings from the contiguous United States and the District of Columbia. The smallest level of geographic detail available is the Census region; no Census division or state level data are available.
Both the 1979 and 1983 CBECS included buildings that were 1,000 square feet or less and included predominantly residential or industrial buildings if commercial activity was present. Beginning in 1986, CBECS only includes buildings larger than 1,000 square feet that are predominantly commercial.
layout files | data files | revised date | |
---|---|---|---|
File 1: Building Characteristics | TXT | CSV | 6/10 |
File 2: Imputation Flags for Building Characteristics | TXT | CSV | 6/10 |
File 3: Consumption and Expenditures | TXT | CSV | 6/10 |
All Format Codes (text file) | TXT | 6/10 | |
Questionnaire (questionnaire begins on page 289) | 6/10 |
Because of the size of the CBECS questionnaire, the master file was divided into 3 files to make it easier to manipulate the data. The 3 files are grouped by subject matter and each contains core variables that are particularly useful to analyze the data. The core variables are the following:
For each group of variables, two items are provided: a layout file and a data file. The layout file is a text file that lists, for each variable on the file: the variable name, a description, the position on the file and the file format. The data file is a comma separated value file that can be opened in Excel.
To determine what the different values for each variable represent, use the text file "All Layout Files and Format Codes." The formats are arranged in alphabetical order and are written so that they may be easily turned into a SAS format library.
Each of the 3 data files can be used by itself or merged with the other files for more complex analyses. The variable BLDGID1 is used to link the merged files.
The CBECS sample was designed so that survey responses could be used to estimate characteristics of the entire commercial buildings stock nationwide. All published CBECS tables report national estimates. To provide national estimates from the CBECS sample, base sampling weights were calculated for each building (the reciprocal of the probability of that building being selected into the sample). Therefore, a building with a base weight of 1,000 represents itself and 999 similar, but unsampled, buildings in the total building stock. The base weight is further adjusted to account for nonresponse bias. The variable XSECWT1 in the data file is the final weight. In order to obtain a national estimate, each sample building's value must be multiplied by the building's weight.
To find the national estimate for... | Do this... | And you should get... |
Total number of buildings | Sum XSECWT1 | 4,267,180.41 (or 4,267 thousand) buildings |
Total number of office buildings | Sum XSECWT1 for cases where BCWM1="11" | 544,954.01 (or 545 thousand) office buildings |
Total floorspace | Create a new variable (weighted square footage) by multiplying XSECWT1 by SQFT1 for each case, then sum this new variable | 54,636,363,945.58 (or 54,636 million) square feet |
Total floorspace in buildings with air conditioning | Sum the new weighted square footage variable (see above) for cases where COOL1="1" | 43,215,194,087.24 (or 43,215 million) square feet |
Total electricity consumption in thousand Btu | Create a new variable (weighted electricity consumption) by multiplying XSECWT1 by ELBTU1 for each case, then sum this new variable | 2,961,324,271,169.81 (or 2,961 trillion) Btu |
Imputation Flags
File 2 contains "imputation flags," variables that begin with "IMP." Imputation is a statistical procedure used to supply values for missing items. Missing values for many, but not all, of the variables were imputed. The imputation flag indicates whether the corresponding data item was reported, imputed, or inapplicable. There are no corresponding imputation flags for variables from the CBECS questionnaire that were not imputed, variables where there was no missing data, and variables that were derived based on other variables.
Confidentiality of Survey Respondents
Several variables have been modified to protect the confidentiality of respondents. This section describes the procedures used.
Square Footage: The reported value of square footage has been modified in two ways, depending on the size of the building. For buildings over one million square feet, the numeric square footage has been replaced with the weighted average square footage of all responding buildings over one million square feet. Separate weighted means were calculated for each of the four Census regions (Northeast, Midwest, South and West). For buildings one million square feet or less, the numeric square footage has been rounded to within ten percent of the upper limit of the buildings' square footage categories. However, if the rounded value fell below the lower limit of the category, the value was coded at this lower limit. For example, buildings in the range 5,001 to 10,000 square feet were rounded to the nearest 1,000 square feet (except that buildings rounding to 5,000 were coded as 5,001).
Number of Workers: For buildings where the numeric number of workers was 5,000 or more, the reported numeric number of workers has been replaced with the weighted average number of workers of all responding buildings with 5,000 or more workers. Separate weighted means were calculated for each of the four Census regions.
Number of Floors: The upper range of the number of floors has been replaced with two categories: 15 to 25 floors (coded as 15 on the file) and over 25 floors (coded as 26 on the file).
Weather Variables: Heating and cooling degree-days which have a base 65 degrees Fahrenheit are included on the data files. The degree-day variables have been inflated or deflated by a random percentage, normally distributed with mean zero and standard deviation 2.0.