Constructing Cohort Data From Inter-census Data Based On Population Track Matching Method

Posted on:2015-10-01

Degree:Master

Type:Thesis

Country:China

Candidate:Y J Xu

Full Text:PDF

GTID:2297330431458970

Subject:Demography

Abstract/Summary:

PDF Full Text Request

Census data is typically a cross-sectional data. We try to find a method to track the records of same individuals in the two censuses, which can extract the information of the same cohort of people from two censuses. Compared by directly using cross-sectional data, analyzing and comparing the cohort data reflects more detailed variations of characteristics between two census years, not only the variation of characteristics,but the variations caused by which group of people.By exploring theâ€™Population Track Matching Methodâ€™, this article extracts cohort data from inter-census data. The method is based on the principle of "Family Structure Stability" and the principle of "Space Related". By putting individuals back into families they belonged to and using the mutual constraints between family members, in order to match "same person" from "one family" in two censuses, realizes the idea of constructing cohort data from inter-census data.The main contributions of this article are as follows, classifies area into different units, first tracking in the small units, then expands to big units; holds household size as logic line, relies on principles of "Family Structure Stability" and "Space Related" sets two match acquirements according to the change of "education degree"This article can be divided into four chapters. The first chapter is introduction, expounds the important points of constructing cohort data from cross-sectional data. The second chapter is about the specific technical route of population track matching method. The basic rules of matching records from two census data are the principle of "Family Structure Stability" and the principle of "Space Related". Household size change as the logical line and match acquirements which provided by census table fields:"gender","birth" and "nationality" must remain the same,"education degree" can change in a reasonable range. When there are one-to-many, and many-to-one and many-to-many match results, using address as auxiliary information to fix the "same person". The third chapter appliesâ€™Population Track Matching Methodâ€™to fifth population census data and sixth census data of Huangpu District, Shanghai to find cohort people. First uses each community as small unit, then expands to the whole district as big unit to track "same person" of "one family" from two censuses. The fourth chapter summarizes the pros and cons ofâ€™Population Track Matching Methodâ€™. The method finally reaches to27.85%household matching rate of fifth census, and44.95%household matching rate of sixth census.

Keywords/Search Tags:

inter-census data, data mining, construct cohort data from cross-sectionaldata

PDF Full Text Request

Related items

1	Analysis Of Age Data Of Population Census In Guizhou Province
2	Design And Implementation Of Analysis System Of Data-mining-based Campus Card Data
3	The Research Of Data Statistic Analysis And Data Mining Based On Higher Education Teaching Information
4	Research And Application Of Abnormality Early-Warning Of Student Compus Activities Based On Big Data Mining
5	Models And Application On University Subjects And Students Data Analysis Using Multiple Data Mining Strategies Method
6	Generalized Mass Teaching Behavior Based Analysis Of The Data
7	Data Mining For Applied Research, Statistical Work
8	Research On College Students’ Academic Achievement Based On Trajectory Data Mining
9	National Matriculation Grade Analysis Based On OLAP And Data Mining Technologies
10	Research On The Application Of Data Mining Technology In University Wisdom Aid Financially