You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: content/en/docs/IV/_index.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -25,6 +25,6 @@ MIMIC-IV is separated into "modules" to reflect the provenance of the data. Ther
25
25
MIMIC-Note is currently not publicly available and the structure is subject to change.
26
26
{{% /pageinfo %}}
27
27
28
-
All patients across all datasets are in `mimic_core`. However, not all ICU patients have ED data, not all ICU patients have CXRs, not all ED patients have hospital data, and so on. Within an individual dataset, there are also incomplete tables as certain electronic systems did not exist in the past. For example, eMAR data is only available from 2015 onward.
28
+
All patients across all datasets are in the [hosp](/docs/iv/modules/hosp) module. However, not all ICU patients have ED data, not all ICU patients have CXRs, not all ED patients have hospital data, and so on. Within an individual dataset, there are also incomplete tables as certain electronic systems did not exist in the past, particularly the eMAR system.
29
29
30
30
Tables for each module are detailed in the respective sections.
Copy file name to clipboardExpand all lines: content/en/docs/IV/about/changelog.md
+52-2Lines changed: 52 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,9 +7,59 @@ description: >
7
7
Changes between releases of MIMIC-IV.
8
8
---
9
9
10
-
The latest version of MIMIC-IV is v1.0.
10
+
The latest version of MIMIC-IV is v2.1.
11
11
12
-
This page lists changes implemented in sequential updates to the MIMIC-IV database. Issues are tracked using a unique issue number, usually of the form #100, #101, etc (this issue number relates to a private 'building' repository).
12
+
This page lists changes implemented in sequential updates to the MIMIC-IV database. Issues are tracked using a unique issue number, usually of the form #100, #101, etc. Note that some of these issues are only accessible in a private 'building' repository.
13
+
14
+
### MIMIC-IV v2.1
15
+
16
+
MIMIC-IV v2.1 was released on November 14, 2022. It removed a subset of subject_id which will be retained internally as a test set. Future data releases will exclude these patients.
17
+
18
+
#### Major changes
19
+
20
+
* A subset of patients were removed from the dataset. 15,748 subject_id were removed from the patients table. 23,093 hadm_id were removed from the admissions table. 3,762 stay_id were removed from the icustays table.
21
+
22
+
### MIMIC-IV v2.0
23
+
24
+
MIMIC-IV v2.0 was released on June 12, 2022. It focused on expanding the data elements available for patients within MIMIC-IV v1.0. Additional data available includes out-of-hospital date of death, information from the online medical record system (which includes height and weight), and more detail for continuous infusions in the ICU.
25
+
26
+
#### Major changes
27
+
28
+
* The core module has been removed to simplify the schema. The _admissions_, _patients_, and _transfers_ tables are now in the hosp module.
29
+
* Neonates have been removed from the dataset. Neonatal data will be released in a separate project with data from the neonatal intensive care unit.
30
+
31
+
#### icu module
32
+
33
+
*_icustays_
34
+
* Around 700 stays (~1%) have changed due to the changes in the _patients_ table.
35
+
*_chartevents, d\_items_
36
+
* The problem list from MetaVision has been added. All problems are documented with the same `itemid` now present in _d\_items_: 220001. There are just over 1,000 unique problems. Most documented problems are related to the care plan for the patient and documented during nurse shift changes (either 7am or 7pm). Less frequently, the ongoing issues are documented here.
37
+
*_ingredientevents_
38
+
* This is a new table associated with _inputevents_. Each intravenous administration tracked in _inputevents_ is associated with a set of ingredients. These ingredients include water content, caloric information, and so on. The goal of the _inputevents_ table is to support nutrition research and to provide a mechanism for estimating fluid input via summing all instances of the water ingredient. These ingredients have been separated from the _inputevents_ table to simplify analysis and reduce the size of _inputevents_.
39
+
*_inputevents_
40
+
* Removed a single column which contained only null values: `cancelreason`.
41
+
*_procedureevents_
42
+
* Removed columns which contained only null values: `totalamount`, `totalamountuom`, `cancelreason`, `comments_editedby`, `comments_canceledby`, `comments_date`, `secondaryordercategoryname`.
43
+
44
+
#### hosp module
45
+
46
+
*_admissions_
47
+
* Fixed an issue where hospitalizations were missing _edregtime_ and _edouttime_ when the patient was admitted via the ED (reported in [#1247](https://github.com/MIT-LCP/mimic-code/issues/1247), thanks [@MEladawi](https://github.com/MEladawi)).
48
+
*_patients_
49
+
*`dod` is now populated with out-of-hospital mortality from state death records. For patients admitted to the ICU, this change has increased capture of date of death from 8,223 records to 23,844 (i.e. we now have out-of-hospital mortality for an additional 15,621 ICU patients).
50
+
* The mechanism for determining patients included in MIMIC was changed. For the most part this has resulted in an improvement, particularly regarding the logic for merging patients who had distinct medical record numbers. As a result of this change, most tables have had a change in the data content. Approximately 1% of stays were affected.
51
+
*_transfers_
52
+
* Fixed a bug where the `outtime` for ED stays with no associated `hadm_id` (i.e. an ED stay where the individual was not admitted to the hospital) was incorrect. This resulted in all _transfers_ rows with a NULL `hadm_id` having an apparent stay of minutes or less. The `outtime` column has now been corrected.
53
+
*_labevents, d\_labitems_
54
+
* The `itemid` for _d\_labitems_ has been changed for 43 items. These are extremely infrequently documented and each `itemid` has fewer than 100 observations in _labevents_. The exact `itemid` are provided in the changelog file CHANGELOG.txt.
55
+
* Errors were found in the current values of `loinc_code` (reported in [#938](https://github.com/MIT-LCP/mimic-code/issues/938), thanks [@Mauvila](https://github.com/Mauvila)). In order to enable collaborative improvement, the `loinc_code` column has been removed, and will now be collaboratively developed in the [MIMIC Code Repository](https://github.com/MIT-LCP/mimic-code/). Initial values will be sourced from the hospital system.
56
+
* A number of labs which previously had the value in the comments field now have the value in the value field (reported in [#941](https://github.com/MIT-LCP/mimic-code/issues/941), thanks [@Mauvila](https://github.com/Mauvila)). This change makes the _labevents_ table more consistent with MIMIC-III, which had these values in the value field.
57
+
*_microbiologyevents_
58
+
* New organisms, tests, specimens, and antibiotics have been added.
59
+
*_omr_
60
+
* A new table has been added: _omr._ The source of this data is the Online Medical Record, and it contains miscellaneous information useful for understanding an individual's health. As of v2.0, the _omr_ table has the following information: blood pressure, height, weight, body mass index, and Estimated Glomerular Filtration Rate (eGFR). These values are available from both inpatient and outpatient visits, and in many cases a "baseline" value from before a patient's hospitalization is available.
61
+
*_prescriptions_
62
+
* The `formulary_drug_cd` table has been added back (was previously in MIMIC-III). This column has the same set of values as the `product_code` column of emar\_detail.
Copy file name to clipboardExpand all lines: content/en/docs/IV/about/concepts.md
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -41,6 +41,7 @@ The *transfers* table contains information for each unique `transfer_id`. `trans
41
41
## `stay_id`
42
42
43
43
The *transfers* table also contains the `stay_id`. This is an artificially generated identifier which groups reasonably contiguous episodes of care.
44
+
The `stay_id` present in *icustays* is derived from the `stay_id` values in the *transfers* table.
44
45
45
46
# date and times
46
47
@@ -83,7 +84,7 @@ For events which occur over a period of time, `starttime` and `endtime` provide
83
84
84
85
### `dod`
85
86
86
-
`dod` is the patient's date of death sourced from the hospital database.
87
+
`dod` is the patient's date of death sourced from one of two sources: the hospital database or a state death database. See the [*patients*](/docs/iv/modules/hosp/patients) documentation for more detail.
Copy file name to clipboardExpand all lines: content/en/docs/IV/modules/ed/_index.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,5 +4,5 @@ linkTitle: "ED"
4
4
date: 2020-08-10
5
5
weight: 40
6
6
description: >
7
-
The ED module contains data for emergency department patients collected while they are in the ED. Information includes reason for admission, triage assessment, vital signs, and medicine reconciliaton. Patient identifiers allow MIMIC-ED to be linked to other MIMIC-IV modules.
7
+
The ED module contains data for emergency department patients collected while they are in the ED. Information includes reason for admission, triage assessment, vital signs, and medicine reconciliaton. The `subject_id` and `hadm_id` identifiers allow MIMIC-IV-ED to be linked to other MIMIC-IV modules.
Copy file name to clipboardExpand all lines: content/en/docs/IV/modules/ed/triage.md
+51-8Lines changed: 51 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,18 +16,61 @@ All fields in *triage* were originally free-text. For deidentification purposes,
16
16
17
17
**Table source:** Emergency department information system.
18
18
19
-
**Table purpose:**
19
+
**Table purpose:**Store information collected on triage to the emergency department.
20
20
21
-
**Number of rows:**
21
+
**Number of rows:**425,087
22
22
23
23
**Links to:**
24
24
25
25
**edstays* on `stay_id`
26
26
27
-
# Important considerations
28
-
29
-
* There is no time associated with triage observations. The closest approximation to triage time is the `intime` of the patient from the *edstays* table.
30
-
27
+
## Important considerations
28
+
29
+
There is no time associated with triage observations. The closest approximation to triage time is the `intime` of the patient from the *edstays* table.
30
+
31
+
The numeric entries in this table were originally stored as free-text. As a result, the columns required deidentification. Free-text entries which could not be converted trivially were removed. Normally, the application of deidentification in MIMIC-IV is indicated using three underscores (`___`) to make it clear to users that we have modified the data. However, due to the data type restriction, we were unable to do this in this case. As a result, **missing data in the numeric columns indicates either deidentified data or no data recorded**. However, this is usually rare. Below is a table demonstrating how often data were removed for deidentification purposes:
32
+
33
+
Column | Number of NULL values inserted for deidentification | Number of rows missing data in v2.1
34
+
--- | --- | ---
35
+
`temperature` | 680 | 23415
36
+
`heartrate` | 292 | 17090
37
+
`resprate` | 223 | 20353
38
+
`o2sat` | 414 | 20596
39
+
`sbp` | 238 | 18291
40
+
`dbp` | 214 | 19091
41
+
`acuity` | 0 | 6987
42
+
43
+
From the above, we can see that of the 23415 rows missing a `temperature` value, only 680 had a free-text value which was deleted during deidentification (~3%).
44
+
45
+
<!--
46
+
SQL queries to generate the above:
47
+
48
+
select
49
+
COUNT(tr_phi.temp) - COUNT(tr.temperature) AS temperature
50
+
, COUNT(tr_phi.hr) - COUNT(tr.heartrate) AS heartrate
51
+
, COUNT(tr_phi.rr) - COUNT(tr.resprate) AS resprate
52
+
, COUNT(tr_phi.sao2) - COUNT(tr.o2sat) AS o2sat
53
+
, COUNT(tr_phi.sbp) - COUNT(tr.sbp) AS sbp
54
+
, COUNT(tr_phi.dbp) - COUNT(tr.dbp) AS dbp
55
+
, COUNT(tr_phi.acuity) - COUNT(tr.acuity) AS acuity
0 commit comments