A great deal of time and resource is currently spent by research teams in the development of operational definitions as part of the overall process of study design and protocol creation in Real World Evidence research. This article aims to outline the challenge to study teams, the costs and time impact, and make recommendations that RWE organisations can make to address them.
For each study protocol, and for each study design element within them, be they inclusion/exclusion criteria, endpoint, outcome measure, exposure, sub-groups or covariates, there needs to be a clearly aligned operational definition.
Additionally when exploring the application of real world data sources to a particular study protocol, creation of the associated code lists and value set must be performed before evaluation of a given real world data source can be performed.
Both the definition of the research question through operational definitions and the associated value sets need to be available before assessment can be made as to whether a Real World Data source is fit-for-purpose.
The process of creation and selection of operational definitions, and their associated code lists and value sets create significant burden to RWE Research organisations. This burden falls specifically in the following areas:
Managing the process of operational definition and value set creation activities across multiple stakeholders (Medical, HEOR, Biostatistics, Data Science, Data Partners) repeatedly fall on EACH study team lead and are repeated for EACH study design.
In order to cushion this impact, teams look for knowledge resources and pre-existing study designs in an effort to re-use definitions and code-lists/value sets. Whilst this seems a logical solution, this information is usually dispersed across the sponsor organisation, causing inefficiency and in some cases study teams are having to reinvent the wheel, or apply knowledge and information with a high risk of being "out of date" simply to be expedient and move the project forward.
A survey of our customers at Navidence estimated a timeline of 4-6 weeks to create operational definitions, at a resource cost of around 200 hours or $50,000...Every Study.
Compounding the challenge of creating a knowledge base of operational definitions, is the fact that the FDA have provided clear guidance of the need to show conceptual and operational definitions, as well as associated mapped data elements and computable phenotypes in all study protocols.
With regulators expecting this information as part of protocol submission, there is a need to be able to create outputs to enable review of each protocol, not only at the operational definition level, but how associated code-list and value sets align with each study element and align with sources of real world data being considered for the study.
This is never more relevant than in the case of responses to safety requirements. The ability to be timely in the responses to regulators on how safety endpoints will be defined and collected is critical; and when time constrained, teams need to be able to turn to a library of definitions that is trusted, and where time has already been spent to align operational definitions and data definitions.
Given the significant costs and time impact occurring within study teams, and the pressures arising from the demand from external stakeholders, there is a strong financial and strategic argument to invest in building a robust library of operational definitions and associated computable phenotypes.
Pre-curated libraries of CODefs (Computable Operational Definitions) can drive a range of significant benefits for an RWE Organisation not only in the ability to perform RWE Design, but to apply these definitions in RWD Assessment and in use cases from External Control Arms to Trial Tokenisation where specificity of definitions, precision of data selection, and the need to provide robust justifications to internal and external are paramount.