The Non-Standard Variables (NSV) Registry serves as a reference for using or creating non-standard variables to drive consistency in implementing CDISC Standards. The NSV Registry consists of the following resources:
- NSVs: A singular resource curated across CDISC Standards to access NSVs to foster standardization when using non-standard variables.
- Fragments: A list of suggested variable-naming fragments as well as principles for creating fragments.
The NSV team is composed of representatives from SDS, CT, SEND, CDASH, and ADaM Standards Development teams who approve the variables and metadata to ensure consistency across the Standards. Additionally, the NSV Team approves fragments to help implementers build the variables consistently.
NSVs and SUPPQUALs
The SDTM does not allow the addition of new variables. Therefore, the Supplemental Qualifiers special-purpose dataset model is used to capture NSVs and their association to parent records in General Observation class datasets (Events, Findings, Interventions), Demographics (DM), and Subject Visits (SV). Supplemental Qualifiers are represented as separate SUPP-- datasets for each dataset containing sponsor-defined variables (see SDTMIG v3.4, Section 8.4.2, Submitting Supplemental Qualifiers in Separate Datasets).
SUPP-- represents the metadata and data for each NSV/value combination. As the name suggests, this dataset is intended to capture additional qualifiers for an observation. Data that represent separate observations should be treated as separate observations.
The Supplemental Qualifiers dataset is structured similarly to the RELREC dataset, in that it uses the same set of keys to identify parent records. Each SUPP-- record also includes the name of the qualifier variable being added (QNAM), the label for the variable (QLABEL), the actual value for each instance or record (QVAL), the origin (QORIG) of the value (see SDTMIG v. 3.4, Section 4.1.8, Origin Metadata), and the evaluator (QEVAL) to specify the role of the individual who assigned the value (e.g., "ADJUDICATION COMMITTEE","SPONSOR"). Controlled Terminology for certain expected values for QNAM and QLABEL is included in Appendix C1, Supplemental Qualifiers Name Codes of the SDTMIG v3.4.
Non-standard Variables (NSVs)
NSVs are found in the Therapeutic Area User Guides (TAUGs), which follow the practices outlined in the Alternative Representation of Non-Standard Variables. Accordingly, SDTM-based examples that contain sample data requiring the use of a variable outside the standard set of variables included in the SDTM are represented not with Supplemental Qualifier records, but with NSVs appended to the end of the parent domain, followed by sample metadata for the NSVs.
CDASH-based examples that contain fields, which collect data that require the use of NSVs annotate their SDTMIG target as variables in the domain’s corresponding NS-- dataset, rather than as QNAM/QVAL pairs in the domain’s corresponding SUPP-- dataset (e.g., for an NSV "XXCCCC", as "NSXX.XXCCCC" rather than as "SUPPXX.QVAL where SUPPXX.QNAM = 'XXCCCC'"). To avoid confusion between standard variables and NSVs in the sample datasets, NSVs have been rendered visually distinct, as shown in Figure 1, with white text on black in the header row and separated from the standard variables by a small space.
Metadata for NSVs, from the Define-XML document that would accompany the submission, are tabulated below the example; only those attributes or elements that assist the example are included. (For more information on variable-level metadata in general, see Define-XML v2.1 Sections 4.3 and 5.3.12).
Figure 1
NSV Curation Principles for entering data into the NSV Registry
Metadata Variable | Rules |
---|---|
Variable Name |
|
Label |
|
Description and Notes |
|
Simple Datatype |
|
XML Datatype |
|
Limited to Domain(s) |
|
Limited to Class(es) |
|
Codelist |
|
External Dictionary |
|
Role |
|
Qualifying Variable(s) |
|
Source(s) |
|
Used in Domain(s) |
|
Recommendations to help in developing Non-Standard Variables.
- Follow ISO and CDISC principles!
- A definition should not contain implementation instructions.
- Definitions should not refer to other variables unless absolutely necessary.
- Acronyms should be spelled out.
- Suggestion: Define Root Variable first.
- Domain specific variable definitions will follow exact language except for addition of domain name.
- Inconsistencies in usage will be easily identified.
- EVS has coded and created definitions for most CDISC metadata (tagged as NCI definitions in NCIt).
- These can be used or edited as the team sees fit.
Metadata rules for entering data into the NSV Registry
Metadata Variable | Rules |
---|---|
Variable Name |
|
Label |
|
Description and Notes |
|
Simple Datatype |
|
XML Datatype |
|
Limited to Domain(s) |
|
Limited to Class(es) |
|
Codelist |
|
External Dictionary |
|
Role |
|
Qualifying Variable(s) |
|
Source(s) |
|
Used in Domain(s) |
|
If a desired NSV can’t be identified in the NSV Registry, then the Fragment Database may be used to create a new non-standard variable. The fragment database was created to facilitate the development of the non-standard variables. The Variable Fragment Naming Rules will provide guidance for the creation of non-standard variables.
When using the fragment spreadsheet to create NSVs, it is important to remember that there are conventions used in CDASH and ADaM that have meaning dependent on where the fragment sits in the variable (whether prefix, suffix or in the middle). It is best to avoid using fragments in these locations for NSVs to avoid conflicts in traceability across the Foundational models.
Variable Label/Variable Fragment Naming Rules
Summary
- The Variable Label is designed to represent the essence of the variable through discrete concepts, breaking down the variable meaning/concept into the following topics sub-concepts:
- What is the primary topic of the given variable (Object or Main Concept)?
- What is being asked about the primary topic (Property and Property Qualifier)?
- What is/are the anticipated response(s) or answer(s) to Representation term, in turn, defines the data type of the answers provided. Having it defined properly helps significantly with identifying what kind of data is collected against the given variable.
- The Variable Label should be created as a combination of:
Object + Property Qualifier(s) + Property + Representation Qualifier (s) + Representation Term
- Use fragments that make the Variable Name as readable as possible.
- Six letters
- Use Main Concept first, following the design strategy in point #1.
- Remove vowels, skinny ones such as "I" first, then consonants (again, skinny letters first).
- Remove duplicate letters.
- Use fragment spreadsheet and more letters for the most important concepts.
- Have 1 - 6 characters available depending on the variable concept/idea
- May be helpful to see the different options to assess the best readability
- Look at the letters and consider if another word comes to mind, if yes, consider a different fragment variation
Considerations when Constructing Non-standard Variables Relative to ADaM
ADaM Non-standard Variables
- Generally, for an ADaM Non-standard Variable, the naming convention is associated with an expected variable type, or an expected value considered overall. For example, if any variable ends with *FL, its value can only be null, Y or N; otherwise, it will trigger ADaM compliance.
- Another example, any variable ending with *DT or DTM, the expected variable type is integer and can’t be character.
- Another example, variables whose names end in GRy, Gy, or CATy are grouping character version of variables, where y is an integer [1-99, not zero-padded] representing a grouping scheme. Other required suffix fragments for use in ADaM variables with details see ADaMIG v1.3 table 3.1.5.1.
SDTM Non-standard Variables
- When SDTM NSVs carry to ADaM datasets, how their name is chosen could impact the ADaM compliance if the ADaM naming convention associated with expected type and/or value is not followed. The SDTM NSV naming convention focuses on the name of the concept (the variable label).
- Name fragments from the ADaMIG: FL, TM, DY, BL, CHG, SC (screening).
- Name fragments from ADaM Oncology Examples: *STT (status), MS (metastasis), P (prior), TS (time since).
* is for suffix. Others with position of this fragment within the variable name are dependent on the purpose of the variable.
Recommendations to help in developing Non-Standard Variables.
- Follow ISO and CDISC principles!
- A definition should not contain implementation instructions.
- Definitions should not refer to other variables unless absolutely necessary.
- Acronyms should be spelled out.
- Suggestion: Define Root Variable first.
- Domain specific variable definitions will follow exact language except for addition of domain name.
- Inconsistencies in usage will be easily identified.
- EVS has coded and created definitions for most CDISC metadata (tagged as NCI definitions in NCIt).
- These can be used or edited as the team sees fit.