ACS - MAISY Residential Socio-Economic, Building,Commuting, Energy Use and Hourly Loads ZIP-Detailed Databases |
 
ACS-MAISY Database Development DetailACS-MAISY Databases use actual individual household socio-economic data provided by US Census American Community Survey as a platform to extend each household's information to include a variety of other important data items. These merged data items include information for individual households in dozens of supporting databases including appliance holdings, energy use, metered hourly loads, travel, building, geodata, and emissions. The data merging process is an AI non-parametric k-nearest neighbor (KNN) machine-learning analysis with regression refinements providing superior accuracy and reliability for the combined household database. More than 6 million individual household records are provided in the ACS-MAISY Databases.... ACS-MAISY Database AI MethodologyDatabase records from supporting databases are identified as belonging to each ACS household's "neighborhood" with a weighted distance metric that reflects several dozens of the household’s ACS characteristics. Variables common to both the ACS household and each household in the supporting databases are used to calculate the "distance" between the ACS and each household in the supporting databases.The KNN algorthim is referred to as a machine-learning system because weights on each variable in the distance measure can be determined with repeated evaluations of sample data to identify the most accurate distance measure with respect to the particular merged variable. Data from the supporting database households with the smallest distance measure is merged with the ACS household record. Any difference between the ACS household and the closest neighbor variable values is adjusted with variable-specific statistical regression-based data refinements. This AI approach extends ACS individual household records to include dwelling unit physical characteristics, appliance holdings and energy use and costs, GHG emissions, ZIP code location, commuting miles, and end-use/whole building hourly electric kW loads. The non-parametric (read flexible) AI KNN estimation process is far superior to traditional engineering and other fixed parameter energy use estimating models like those used to develop DOE and NREL energy use and hourly load estimates. The AI KNN application applies actual data derived from similar "neighborhood" households whereas the DOE/NREL approach relies on models that reflect fixed relationships applied to all households, missing nuances that exist in small geographic areas and detailed household/dwelling unit segments - that is data items that are not explicitly reflected in the inflexible DOE/NREL models. The schematic illustrates the ACS-MAISY AI KNN process. Individual ACS household records are first geo-located to identify ZIP codes. ACS household record matching is accomplished with the KNN process described above to integrate energy use, appliances, hourly loads, dwelling unit, emissions, commuting data, climate, place, county, metro and utility names, and other information and then provided as a comprehensive household database. ACS-MAISY Increases Geographic GranularityACS-MAISY ZIP-level Databases provide 11 times more granular geographic detail than ACS public use databases provided by the Census Bureau. ACS public use databases identify each household location within a Public Use Micro Data areas (PUMAs) whereas ACS-MAISY household records identify more detailed ZIP code tabulation areas within each PUMA. An average PUMA includes about 11 ZIP code areas. Significant variation in household/dwelling unit characteristics typically occurs across ZIP code areas within each PUMA. Average ZIP code tabulation areas provide information on approximately 5,000 households whereas PUMA areas reflect aggregate characterizations of much wider geographic areas including approximately 56,000 households.NEW: ZIP-level Detail and CrosstabsIn addition to more detailed geographic information ACS-MAISY Databases support ZIP-level crosstabs whereas ACS public use databases can only provide more geographically aggregated PUMA crosstabs.Socio-economic, dwelling unit and other household characteristics typically vary widely within each puma. Both the increased geographic detail and ZIP level cross-tab ability make the ACS-MAISY Databases far superior at supporting detailed socio-economic, marketing, market sizing and other applications compared to Census PUMA-based analysis. Data Formats Like You WantDatabase information can be provided in spreadsheets with data for individual household records in separate rows, and/or processed by us to provide tables, cross tabs or any client-specified analysis results.ACS-MAISY Database ItemsThe following table shows data item detail available with the ACS-MAISY Database. Clients can select any data groups or data items for any geographic area (single ZIP, county, metro, etc.) and data format/analysis preference including full CVS databases with each household data items on a single row, individual relational databases that can be merged (joined), tables, cross-tabs, etc.Electric loads (kW) are available as 8760, 35040 15-minute, or monthly averages (weekday, weekend day and peak day). Load detail includes whole building loads and individual end use loads. Not sure exactly what data items you need, or which data items are best for the project at hand? - Let us help! – just e-mail us with your questions and/or suggest a time to discuss. We provide free consultations to help identify the most useful data/analysis for your application. We also provide free telephone support to assist in client data applications after data delivery.
Additional MAISY Residential Database Background DetailClick here to see a variety of related background information.Save Cost, Time and Hassle and Improve Analysis Accuracy With ACS-MAISY DatabasesWhether you want to extend socio-economic evaluations to the ZIP level, expand household information with new data items (e.g. energy bills, emissions, floor space), drill down on specific ZIP household segments or are looking to improve you market analyis and target marketing, ACS-MAISY Databases provide new and more accurate analysis results.Some applications:
Database CostsACS-MAISY Databases include proprietary information belonging to Jackson Associates and are licensed to individual users who agree to use the information only within their own organization. Information is provided in any of the following formats: (1) databases with individual household records in CSV files for client-specified database items and ZIP codes, (2) client-specified tables/cross-tabs for any combination of database items and ZIP codes. Database costs depend on data items and ZIP codes included in the data request. Data extraction costs start at $195. We provide free consultations to discuss data needs and will be happy to suggest appropriate data selections for you project.We understand the difficulties in identifying exactly the information you need for your project. Casting a wide net costs more; however, trying to minimize data collection costs can dramatically reduce information value. We have worked with start-ups and some of the largest energy-industry organizations to thread this needle based on our energy data analysis experience. We can help identify options and suggest the most effective data develoment strategies for your projects. – just e-mail us with your questions and/or suggest a time to discuss. We provide free consultations to help identify the most useful data/analysis for your application. We also provide free telephone support to assist in client data applications after data delivery. |