Difference between revisions of "2.3.2 Primary analysis and evaluation of raw data"
Line 3: | Line 3: | ||
Information about primary analysis of raw data is critical for establishing a connection between raw data and reported results and is therefore an essential part of data traceability (see item [[3.1.2.1 Traceability of data and any person having impact on data]]). | Information about primary analysis of raw data is critical for establishing a connection between raw data and reported results and is therefore an essential part of data traceability (see item [[3.1.2.1 Traceability of data and any person having impact on data]]). | ||
− | |||
Line 9: | Line 8: | ||
Primary analysis of raw data should: | Primary analysis of raw data should: | ||
* be performed blinded (e.g. by an experimenter unaware of pharmacological treatment) | * be performed blinded (e.g. by an experimenter unaware of pharmacological treatment) | ||
− | ** NOTE that, for knowledge-claiming research ([[Purpose of research]], this is a requirement | + | ** NOTE that, for knowledge-claiming research ([[2.1.4 Purpose of research]], this is a requirement |
* maintain the original randomization scheme (if applicable) | * maintain the original randomization scheme (if applicable) | ||
* follow a pre-specified analysis plan that may be a part of the study plan | * follow a pre-specified analysis plan that may be a part of the study plan | ||
− | ** NOTE that, for knowledge-claiming research ([[Purpose of research]], this is a requirement | + | ** NOTE that, for knowledge-claiming research ([[2.1.4 Purpose of research]], this is a requirement |
* include data verification (even in case of data produced by automatic systems there are generally additional data which are manually produced. Examples may be body weight, volume of drugs administered, unplanned observations performed during an experiment such as aberrant behavior) | * include data verification (even in case of data produced by automatic systems there are generally additional data which are manually produced. Examples may be body weight, volume of drugs administered, unplanned observations performed during an experiment such as aberrant behavior) | ||
* include a data validity check i.e. with respect to acceptance criteria pre-defined in the study plan | * include a data validity check i.e. with respect to acceptance criteria pre-defined in the study plan | ||
Line 23: | Line 22: | ||
'''PLEASE DO NOT FORGET''' | '''PLEASE DO NOT FORGET''' | ||
* To consider adding this subject to a training program for new employees or refresher training | * To consider adding this subject to a training program for new employees or refresher training | ||
− | * To label and store all primary analysis files in such a way that it ensures data traceability (for details see item [[3.1.2.1 Traceability of data and any person having impact on | + | * To label and store all primary analysis files in such a way that it ensures data traceability (for details see item [[3.1.2.1 Traceability of data and any person having impact on data]]) |
* Outside the pre-specified criteria, exclusion of data points and observations is only possible as long as primary analysis is conducted blind (i.e. before unblinding) | * Outside the pre-specified criteria, exclusion of data points and observations is only possible as long as primary analysis is conducted blind (i.e. before unblinding) | ||
* All decisions to exclude data MUST be transparent (e.g. if necessary, recorded and reported) | * All decisions to exclude data MUST be transparent (e.g. if necessary, recorded and reported) | ||
− | |||
− | |||
− | |||
− | |||
+ | == C. Resources == | ||
+ | to be added | ||
Revision as of 17:28, 5 September 2020
A. Background & Definitions
Primary analysis of raw data is the data processing required in order to derive (secondary) data that will be shared, presented and/or subjected to statistical analysis.
Information about primary analysis of raw data is critical for establishing a connection between raw data and reported results and is therefore an essential part of data traceability (see item 3.1.2.1 Traceability of data and any person having impact on data).
B. Guidance & Expectations
Primary analysis of raw data should:
- be performed blinded (e.g. by an experimenter unaware of pharmacological treatment)
- NOTE that, for knowledge-claiming research (2.1.4 Purpose of research, this is a requirement
- maintain the original randomization scheme (if applicable)
- follow a pre-specified analysis plan that may be a part of the study plan
- NOTE that, for knowledge-claiming research (2.1.4 Purpose of research, this is a requirement
- include data verification (even in case of data produced by automatic systems there are generally additional data which are manually produced. Examples may be body weight, volume of drugs administered, unplanned observations performed during an experiment such as aberrant behavior)
- include a data validity check i.e. with respect to acceptance criteria pre-defined in the study plan
Data generated via primary analysis of raw data should be securely stored (see item 3.1.1 Platform to record data). Alternatively, one may store tools, algorithms, scripts and related analysis-related information that would be sufficient to reconstitute the analysis. If the latter approach is taken, two requirements apply:
- Repetition of the analysis should be possible for any researcher with the necessary skills
- One should ensure technical feasibility of such re-analysis for the entire period during which raw data are stored (e.g. ability to re-analyze should not be affected by updates in software or readability of guiding information)
PLEASE DO NOT FORGET
- To consider adding this subject to a training program for new employees or refresher training
- To label and store all primary analysis files in such a way that it ensures data traceability (for details see item 3.1.2.1 Traceability of data and any person having impact on data)
- Outside the pre-specified criteria, exclusion of data points and observations is only possible as long as primary analysis is conducted blind (i.e. before unblinding)
- All decisions to exclude data MUST be transparent (e.g. if necessary, recorded and reported)
C. Resources
to be added
back to Toolbox
Next item: 2.3.3 Statistical analysis