De-Indentified Data
Definition: Health information is considered de-identified when it does not
identify an individual and there is no reasonable basis to believe that the
information can be used to identify an individual. Information is considered
de-identified if 18 identifiers are removed from the health information and
if the remaining health information could not be used alone, or in combination,
to identify a subject of the information. The identifiers include:
- names
- geographic subdivisions smaller than a state, including street
address, city, county, precinct, zip code and equivalent geocodes, except
for the
initial three digits of a zip code to 000
- all elements of dates (except year) for dates directly related
to an individual, including birth date, admission date, discharge date,
date of death, and
all ages over 89
- telephone numbers
- fax numbers
- electronic mail addresses
- Social Security numbers
- medical record numbers
- health plan beneficiary numbers
- account numbers
- certificate/license numbers
- vehicle identifiers and serial numbers, including license
plate numbers
- device identifiers and serial numbers
- Web Universal Resource Locator (URL)
- biometric identifiers, including finger or voice prints
- full face photographic images and any comparable images
- Internet Protocol address numbers
- any other unique identifying number characteristic or
code
How might this type of information be useful? De-identified data might be useful if you wish to obtain ballpark figures, e.g., the number of patients seen with diabetes, between the ages of 35-50. It is not possible to approach patients about participating in a study by using de-identified data.
Access requirements: To obtain de-identified data you need only to obtain a waiver of authorization. (Additional forms are unnecessary for this category of PHI.)