Cohort study

For other senses of this word, see cohort (disambiguation).

A cohort study is a particular form of longitudinal study (panel study) that sample a cohort (a group of people who share a defining characteristic, typically who experienced a common event in a selected period, such as birth or graduation), performing a cross-section at intervals through time. A cohort study is a panel study, but a panel study is not always a cohort study as individuals in a panel study do not always share a common characteristic.[1]

A cohort study is a quasi-experiment used in medicine, nursing, psychology, social science, actuarial science, business analytics, and ecology. For instance in medicine, it is an analysis of risk factors and follows a group of people who do not have the disease, and uses correlations to determine the absolute risk of subject contraction. It is one type of clinical study design and should be compared with a cross-sectional study. Cohort studies are largely about the life histories of segments of populations, and the individual people who constitute these segments.[2][3]

A cohort is a group of people who share a common characteristic or experience within a defined period (e.g., are currently living, are exposed to a drug or vaccine or pollutant, or undergo a certain medical procedure). Thus a group of people who were born on a day or in a particular period, say 1948, form a birth cohort. The comparison group may be the general population from which the cohort is drawn, or it may be another cohort of persons thought to have had little or no exposure to the substance under investigation, but otherwise similar. Alternatively, subgroups within the cohort may be compared with each other.

Randomized controlled trials (RCTs) are a superior methodology in the hierarchy of evidence in therapy, because they limit the potential for any biases by randomly assigning one patient pool to an intervention and another patient pool to non-intervention (or placebo). This minimizes the chance that the incidence of confounding (particularly unknown confounding) variables will differ between the two groups. However, it is important to note that RCTs may not be suitable in all cases and other methodologies could be much more suitable to investigate the study's objective(s).

Cohort studies can be retrospective (looking back in time, thus using existing data such as medical records or claims database) or prospective (requiring the collection of new data).[4]


In medicine, a cohort study is often undertaken to obtain evidence to try to refute the existence of a suspected association between cause and effect; failure to refute a hypothesis often strengthens confidence in it. Crucially, the cohort is identified before the appearance of the disease under investigation. The study groups follow a group of people who do not have the disease for a period of time and see who develops the disease (new incidence). The cohort cannot therefore be defined as a group of people who already have the disease. Prospective (longitudinal) cohort studies between exposure and disease strongly aid in studying causal associations, though distinguishing true causality usually requires further corroboration from further experimental trials.

The advantage of prospective cohort study data is that it can help determine risk factors for contracting a new disease because it is a longitudinal observation of the individual through time, and the collection of data at regular intervals, so recall error is reduced. However, cohort studies are expensive to conduct, are sensitive to attrition and take a long follow-up time to generate useful data. Nevertheless, the results that are obtained from long-term cohort studies are of substantially superior quality to those obtained from retrospective/cross-sectional studies. Prospective cohort studies are considered to yield the most reliable results in observational epidemiology. They enable a wide range of exposure-disease associations to be studied.

Some cohort studies track groups of children from their birth, and record a wide range of information (exposures) about them. The value of a cohort study depends on the researchers' capacity to stay in touch with all members of the cohort. Some studies have continued for decades.

In a cohort study, the population under investigation consists of individuals who are at risk of developing a specific disease or health outcome.[5]


An example of an epidemiological question that can be answered using a cohort study is: does exposure to X (say, smoking) associate with outcome Y (say, lung cancer)? Such a study would recruit a group of smokers and a group of non-smokers (the unexposed group) and follow them for a set period of time and note differences in the incidence of lung cancer between the groups at the end of this time. The groups are matched in terms of many other variables such as economic status and other health status so that the variable being assessed, the independent variable (in this case, smoking) can be isolated as the cause of the dependent variable (in this case, lung cancer). In this example, a statistically significant increase in the incidence of lung cancer in the smoking group as compared to the non-smoking group is evidence in favor of the hypothesis. However, rare outcomes, such as lung cancer, are generally not studied with the use of a cohort study, but are rather studied with the use of a case-control study.

Shorter term studies are commonly used in medical research as a form of clinical trial, or means to test a particular hypothesis of clinical importance. Such studies typically follow two groups of patients for a period of time and compare an endpoint or outcome measure between the two groups.

Randomized controlled trials, or RCTs are a superior methodology in the hierarchy of evidence, because they limit the potential for bias by randomly assigning one patient pool to an intervention and another patient pool to non-intervention (or placebo). This minimizes the chance that the incidence of confounding variables will differ between the two groups.

Nevertheless, it is sometimes not practical or ethical to perform RCTs to answer a clinical question. To take our example, if we already had reasonable evidence that smoking causes lung cancer then persuading a pool of non-smokers to take up smoking in order to test this hypothesis would generally be considered quite unethical.

Two examples of cohort studies that have been going on for more than 50 years are the Framingham Heart Study and the National Child Development Study (NCDS), the most widely researched of the British birth cohort studies. Key findings of NCDS and a detailed profile of the study appear in the International Journal of Epidemiology.[6]

The Dunedin Longitudinal Study, started in 1975, has been studying the thousand people born in Dunedin, New Zealand in 1972-73. The subjects are interviewed regularly, with Phase 45 starting in 2017.

The largest cohort study in women is the Nurses' Health Study. Started in 1976, it is tracking over 120,000 nurses and has been analyzed for many different conditions and outcomes.

The largest cohort study in Africa is the Birth to Twenty Study, which began in 1990 and tracks a cohort of over 3,000 children born in the weeks following Nelson Mandela's release from prison.

Other famous examples are the Grant Study tracking a number of Harvard graduates from ca. 1950.77, the Whitehall Study tracking 10,308 British civil servants, and the Caerphilly Heart Disease Study, which since 1979 has studied a representative sample of 2,512 men, drawn from the Welsh town of Caerphilly.[7]


Comparison of timelines of case-control and cohort studies

The diagram indicates the starting point and direction of cohort and case-control studies. In Case-control studies the analysis proceeds from documented disease and investigations are made to arrive at the possible causes of the disease. In cohort studies the assessments starts with the putative cause of disease, and observations are made of the occurrence of disease relative to the hypothesized causal agent.[8]

Current and historical cohorts

A current cohort study represents a true prospective study where the data concerning exposure are assembled prior to the occurrence of the fact to be studied, for instance a disease. An example of a current cohort study is the Oxford Family Planning Association Study in the United Kingdom, which aimed to provide a balanced view of the beneficial and harmful effects of different methods of contraception. This study has provided a large amount of information on the efficacy and safety of contraceptive methods, and in particular oral contraceptives (OCs), diaphragms and intrauterine device (IUDs).

In a historical cohort study the data concerning exposure and occurrence of a disease, births, a political attitude or any other categorical variable are collected after the events have taken place, and the subjects (those exposed and unexposed to the agent under study) are assembled from existing records or health care registers.

A "prospective cohort" defines the groups before the study is done, while historical studies, which are sometimes referred to as "retrospective cohort", defines the grouping after the data is collected. Examples of a retrospective cohort are Long-Term Mortality after Gastric Bypass Surgery[9] and The Lothian Birth Cohort Studies.[10]

Though historical studies are sometimes referred to as retrospective study, it a misnomer as the methodological principles of historical cohort studies and prospective studies are the same.[8]

Nested case-control study

An example of a nested case-control study is Inflammatory markers and the risk of coronary heart disease in men and women, which was a case control analyses extracted from the Framingham Heart Study cohort.[11]

Household panel survey

Household panel surveys are an important sub-type of cohort study. These draw representative samples of households and survey them, following all individuals through time on a usually annual basis. Examples include the US Panel Study of Income Dynamics (since 1968), the German Socio-Economic Panel (since 1984), the British Household Panel Survey (since 1991), the Household, Income and Labour Dynamics in Australia Survey (since 2001) and the European Community Household Panel (1994–2001).

Cohort analysis in business

For an example in business analysis, see cohort analysis.

See also


  1. "What is the difference between a Panel Study and a Cohort Study?". Academia Stack Exchange. Retrieved 3 February 2016.
  2. "Cohort Studies". Web Center for Social Research Methods. Archived from the original on 2011-09-09.
  3. Porta, Miquel, ed. (2008). A Dictionary of Epidemiology (5th ed.). New York: Oxford University Press.
  4. "FAQ: What is a cohort study?". California Department of Public Health, Environmental Health Investigations Branch. June 2, 2001.
  5. Blumenthal, Ursula J.; Fleisher, Jay M.; Esrey, Steve A.; Peasey, Anne (2001). "Chapter 7: Epidemiology: a tool for the assessment of risk" (PDF). In Fewtrell, Lorna; Bartram, Jamie. Water Quality: Guidelines, Standards, and Health : Assessment of Risk and Risk Management for Water-related Infectious Disease. IWA Publishing. pp. 143–144.
  6. Power C, Elliott J (2006). "Cohort profile: 1958 British Cohort Study". International Journal of Epidemiology. 35 (1): 34–41. doi:10.1093/ije/dyi183. PMID 16155052.
  7. "Caerphilly and Speedwell collaborative heart disease studies. The Caerphilly and Speedwell Collaborative Group". Journal of Epidemiology and Community Health. 38 (3): 259–262. 1984. doi:10.1136/jech.38.3.259. PMC 1052363Freely accessible. PMID 6332166..
  8. 1 2 Meirik, O. "Cohort and Case-Control Studies". In Campana, Aldo. Reproductive Health. Geneva Foundation for Medical Education and Research.
  9. Adams TD, Gress RE, Smith SC, et al. (2007). "Long-term mortality after gastric bypass surgery". N. Engl. J. Med. 357 (8): 753–61. doi:10.1056/NEJMoa066603. PMID 17715409.
  10. "The Lothian Birth Cohort Studies". University of Edinburgh. Archived from the original on 2009-10-22. Retrieved 8 May 2011.
  11. Pai, JK; Pischon, T; Ma, J; et al. (2004). "Inflammatory markers and the risk of coronary heart disease in men and women". N. Engl. J. Med. 351 (25): 2599–2610. doi:10.1056/NEJMoa040967. PMID 15602020.

External links

This article is issued from Wikipedia - version of the 11/7/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.