Curates subject- and file-level attributes with values derived from file attributes.
Uses the nacc-attribute-deriver package.
When run on a project, this gear pulls all of the files with the file suffix from the config (defaults to JSON files) and organizes them by subject. The gear then runs the a curation process on all of the files for a subject using an order determined by file type (scope) and date. Each file is curated using the attribute deriver in order over the custom information of the subject and the data file.
The curation order needs to be kept consistent with the rules in the attribute deriver package. At the moment, basically UDS form files need to be visited after all other files to ensure that NACC derived variables can be set.
Most of this logic is handled in common/src/curator/scheduling.py
and common/src/curator/form_curator.py
filename_pattern
config (defaults to *.json
for all JSON files)file.info
and/or subject.info
, depending on the specific curation rules applied for that scopesubject.info.derived.cross-sectional
values back into each UDS file
subject.info.derived.affiliate
is True with the affiliated
tag. This is determined by the attribute-deriver