Industry Reports

Clinical Data Industry Research

Two-Thirds of Data Managers and CRAs Warn of Data Quality Risks if Trial Inefficiencies Persist

1. Executive Summary

Pharma and biotech teams face pressure to deliver faster trial timelines under increasingly
complex protocols and constrained budgets. Meeting these demands requires not just innovation,
but a willingness to depart from ‘good enough’ processes and technology for those carrying out
clinical data tasks.

Inefficiency is the norm for many clinical professionals. Data managers and clinical research
associates (CRAs) – the people responsible for collecting, monitoring, and ensuring the data is fit for
purpose – are bogged down by manual processes and fragmented systems. Too much time is lost
to repetitive steps, re-entry of information, and workarounds outside core clinical platforms.

This research identifies the scale and root causes of that lost productivity in typical Phase III trials,
and offers a path forward.
We surveyed 88 data managers and CRAs and conducted in-depth interviews
with select participants to gather first-hand accounts of the impact
of workflow inefficiencies.

Main takeaways

  • Inefficiencies are embedded in workflows
    Most data manager tasks take more time than they should, especially manual data reconciliation
    and data review & cleaning. CRAs say updating multiple systems, writing reports, and following up
    with sites all take longer than they should.
  • Manual work and disconnected systems are the root cause of inefficiencies
    CRAs are impacted the most by system fragmentation, with data managers citing too many manual
    steps and a lack of modern tool usage as the main reason for inefficiency. Most manual data
    reconciliation work is happening outside of clinical systems, or using multiple different systems.
  • The cost of inaction is burnout and data quality
    Burnout or staff turnover and data quality issues were some of the greatest consequences for
    CRAs and data managers if inefficiencies are left unaddressed. CRA burnout and turnover is a widely
    reported issue, but could be an increasing trend for data managers.
  • Organizational resistance to change hinders progress
    Despite the challenges caused by inefficient processes and systems, almost half of data managers
    and CRAs say that resistance to change is a key barrier to a more productive future state. A sense
    of current workarounds being ‘good enough’ hides the damage caused by inefficiency in clinical trials.
    Nearly half of respondents think that SOPs fail to reflect real-world workflows or underutilize available
    tools, demonstrating the organizational barriers to greater efficiency.
  • Future roles focus on pragmatic change
    AI support and risk-based approaches are anticipated to evolve data managers’ work over the next two
    years, but data managers cited automation as #1 for their role evolution. The vast majority of CRAs
    expect risk-based monitoring to become the norm in their roles, but feel the stretch from greater study
    complexity and budget/resource constraints.
  • System modernization is a clear opportunity
    Three quarters of data managers and over half of CRAs say their teams are actively looking for ways to
    modernize trial execution. Nearly all respondents feel that better integration between clinical data and
    operations systems would significantly improve execution.

“Identifying inconsistencies between two systems for the same kind of data should be easy in the
advanced tech world we live in.”

Data manager’s response to the question “Why is [manual data reconciliation] so inefficient?”

2. Key Findings

2.1
Inefficiencies faced by data managers

Data managers spend the greatest proportion of their
time on data review and cleaning (18%) [Figure 1].
This task is also a top inefficiency, cited as the second
most inefficient task (35%) after manual data reconciliation
(52%). Manual reconciliation occupies about 11% of data
manager’s time yet is their highest priority to improve,
indicating that inefficiency is a bigger driver of frustration
than time spent on a task.

“We have home-grown tools for
data cleaning but I don’t trust
them because they cannot
handle the variability in data
structure between studies.
Data sources have differences
in date formatting, extra spaces
in an output, column naming…
They can differ by source and by
study. There is too much variation
so we stick with Excel.”

Senior Clinical Data Specialist,
Global CRO

Tasks flagged as the most inefficient (manual data reconciliation
and data review and cleaning) are also among the tasks more
often performed outside clinical systems.
For those citing data
reconciliation as a top inefficiency, 97% perform reconciliations
either outside of clinical systems (50%) or with a mix of systems
(47%). For data review and cleaning, 39% of respondents perform
the task outside clinical systems, 52% with a mix of systems,
and only 9% within a single clinical system.

“It feels like for every study, we’re
reinventing what data review is.
There’s a lack of good, reusable
data cleaning programming,
which would help with both
[data cleaning and review and
manual data reconciliation].”

Data Analyst, Small Biopharma

One senior data manager from a global CRO estimates that of the three hours each data manager
spends on reviewing data, one hour is wasted reviewing unchanged data, and another hour is spent
applying Excel formulas. According to this account, data managers are only spending one third of
their data review time reviewing new data.

The senior data manager explains the challenges around a lack of insight into data changes during data
review: “I complete a round of reconciliation and the next month when I download the data there is no
way to tell what data has changed. We apply a lot of Excel formulas and effort to figure out what’s been
updated and what queries got opened. Every month I pull the same reports and apply the same effort,
even though 70% of the data is the same.

2.2
Inefficiencies faced by CRAs

While CRAs spend most of their time on monitoring visits (22%), perceived inefficiencies center on
documentation and manual tracking – tasks that collectively consume around 18% of CRAs’ time yet
generate the highest frustration [Figure 2]. Documentation and manual tracking are identified as top
priorities for improvement by 44% of respondents.

Managing documents in multiple systems and carrying out
repetitive steps are major sources of inefficiency. One CRA from
a top 20 biopharma expresses surprise that there aren’t more
automated solutions available: “We still have to do so much manual
tracking, there’s no connection between the systems.”

Veeva benchmark analysis from over 30 biopharma companies
spanning 10 years shows that a total of 1,286 days per 1,000
monitoring visit reports (MVPs) was wasted. This equates to
$1,028,888, based on an 8 hour work day at a $100/hour rate.
Working in trackers such as Excel, OneNote, and Word accounted
for the most time wasted, costing $326,112 per 1,000 MVPs. If your
CRAs complete 20,000 MVPs, for example, the cost would reach
$6,522,240.

“I use OneNote for documenting
monitoring visits before
transcribing into the CTMS.
It’s not that I like it — I don’t have
any other tool to keep everything
in one place. I have to capture
[data] in a non-validated system.”

CRA Subject Matter Expert,
Top 20 Biopharma Company

2.3
Cost of queries

Both data managers and CRAs reported that non-EDC queries take
far longer (71%) than EDC queries to resolve. Non-EDC queries
are those generated from third-party data, such as imaging, eCOA,
and central labs. On average, an EDC query takes 9.6 days to resolve,
from when it’s created to closure, whereas non-EDC queries take
16.4 days. In terms of personal effort on query management,
respondents use 34% more active time to resolve a query on thirdparty
data than a query on EDC data. This difference in resolution
time is due in part to the delay in non-EDC data reaching the data
manager or CRA, and the additional effort required to investigate
with the vendor.

It is estimated that a query costs between $28-$225 to resolve
and that, on average, a Phase III study generates 96,980 queries.
Non-EDC queries require 34% more active time, suggesting that their
cost ranges from $32.50-$300 per query. This suggests that the
cost per query is growing to $32.50-$300, and that inefficiencies in
query management – particularly non-EDC queries – reach millions
of dollars in Phase III trials.

“Non-EDC queries take longer
to resolve in part because data
managers aren’t receiving the
data frequently. When you are
querying the data it is months old.
The site has to check different
systems to find the actual reason
or value. We may also keep the
query open until the next data
transfer, to confirm whether the
query was answered correctly
or if there is a change on the
vendor side.”

Senior Clinical Data Manager,
Global CRO

3. Root Causes of Unproductive Effort

3.1
Top drivers of inefficiency for data managers and CRAs

Dealing with disconnected systems is the primary cause of inefficiency for CRAs (71%). Too many
manual steps or re-entry (74%) is the main reason why tasks require extra effort and time in data
managers’ daily work [Figure 3].

One data manager’s description of why manual reconciliations are so inefficient highlights the impact
of too many manual steps on workflows: “The worst part of data reconciliation is tracking issues from
the prior reconciliation to the new data load. Most of us use Excel formulas and manually copy/paste,
use Vlookup, carry forward the comment, and then add the latest status. It is painful.

3.2
Tools and processes as sources of inefficiency

When asked whether technology issues or process issues mostly contribute to workflow inefficiencies,
or both equally, 48% of respondents said both technology and process issues equally. CRAs were slightly
more likely than data managers to point towards tech-specific challenges (30% vs. 25%), as were those
working for larger companies (>10,000 employees).

Sponsors are twice as likely to cite technology issues as the primary source of inefficiency than CROs
(37% vs. 16%), while CROs more often point to process challenges (29%) or a combination of the two
(49%) [Figure 4].

4. Implications of Lost Productivity

4.1
Risks and consequences of inaction for data managers and CRAs

Inefficiencies in Phase III clinical trials have wide reaching impacts, on the trial itself and on those
working on the studies. Only 6% of respondents believe that there will be little impact if inefficiencies
remain as they are [Figure 5]. For CRAs, the greatest concern is burnout (91%), highlighting an urgent
need to act. Clinical data managers also cite burnout as a key risk, but are most concerned with data
quality issues (71%).

As risk-based data management approaches become more mainstream, this potentially exacerbates the
threat to data quality as teams move away from reviewing 100% of the data. The significant concern over
data quality could impede any move towards risk-based reviews, as data managers may be reticent to
review anything less than 100% of the data. However, with data volumes growing and burnout risk rising,
it is untenable to continue reviewing 100% of the data. Inefficiencies must be addressed for better data
quality and data manager upskilling.

As expected, higher trial costs and longer timelines are commonly predicted consequences of inaction.
Respondents working at midsize companies are more concerned about higher trial costs (74%) than
those at larger companies (52%). These respondents are also more likely to cite regulatory risk as a key
concern (42%) compared to larger companies (26%).

4.2
Barriers to a more productive future

The biggest challenges to realizing a more efficient future state are
protocol complexity (58%), budget/resource constraints (57%), and
resistance to change (48%). Data managers were especially likely
to cite training gaps (57%), highlighting the need for support as
processes and technologies evolve. CRAs were concerned about
study complexity and budget/resource constraints equally (65%),
with only 22% feeling that training or skills were a barrier.

Most respondents (81%) agree that better system connectivity
would improve trial execution. Only 57% of CRAs say that their team
is actively seeking ways to modernize, compared with 75% of data
managers. However, 41% feel that SOPs fail to reflect real-world
workflows or underutilize available tools, highlighting a cultural and
procedural gap that may slow progress.

“Some trials are too complex. If I
don’t understand something as
a user acceptance tester, I have
to ask someone else which takes
more time than training would
in the long run. Lack of training
on therapeutic area databases
can also result in inadequate
testing and incomplete scenario
coverage. This can lead to issues
that require post-production
changes, potentially affecting
subject data.”

Senior Clinical Data Manager, Global CRO

5. The Future Role of Clinical Data Professionals

5.1
Evolution of the clinical data manager role

Data managers have a pragmatic view of how their role will evolve over the next two years. Most expect
to use more automation for data cleaning (71%), whereas fewer think AI assistance will support them in
decision-making, completing actions, or surfacing data insights (59%). This shows that data managers
expect – and prioritize – more automation in their operations before pursuing AI assistance.

Interestingly, fewer expect their roles to shift in a more strategic direction (37%), despite the industrywide
focus on clinical data science, and only 22% think they will be involved earlier in trial design. This
suggests that data manager role evolution will be constrained if inefficiencies are not addressed first.

5.2
Evolution of the CRA role

The vast majority (83%) of CRAs expect risk-based monitoring to become the norm over the next two
years. One CRA explains their vision for a future-state:

“I would love to have a risk-based tool where the monitor can assess whether a site needs more SDV,
based on factors like number of PDs, timeliness of data activities, and number of queries. We need good
connectivity between systems, because we capture PDs in one system and EDC data in another. A good
RBQM tool needs to read across all systems to understand the state of a site.”

Compared with data managers (9%), 22% of CRAs think there will be minimal change to their role over
the next two years. AI and automation are both seen as likely to impact future roles, as is a greater use of
remote monitoring and centralized review (57% each). These shifts point to a future CRA role that’s more
tech-enabled and focused on critical risk signals, not just site visits.

6. Conclusion and Recommendations

This research highlights a clear message from clinical data managers and CRAs: an unacceptable
amount of time is spent on manual tasks and disjointed workflows. These inefficiencies are
risking data quality, burnout, and avoidable costs. It marks an executive call-to-action for change,
not complacency with past ways of working.

A notable new trend is the burnout risk among data managers – a longstanding challenge for CRAs but
a historically less reported concern for data managers. As leaders look towards increasingly innovative
data management approaches, this research reminds us that the people doing the work are not enabled
by modern technology or efficient processes. We recommend a practical approach:

  1. Prioritize system integration
    Invest in solutions that connect clinical data and operations systems.
    This will reduce manual data entry, and provide a more unified view of
    trial progress.
  2. Embrace automation
    Centralize data review and reconciliation efforts in an automated tool that
    directly addresses the most time-consuming and frustrating manual tasks,
    within a system connected with clinical operations. See how others are optimizing these processes in this video.
  3. Streamline workflows
    Review and update SOPs to ensure they reflect real-world workflows and
    fully leverage available technologies. This may involve revisiting processes
    that are overly complex or rely on outdated manual steps.
  4. Invest in ongoing training and change management
    Provide adequate training and ongoing support to data managers and CRAs for
    long-term adoption of new tools and processes. This will help overcome resistance
    to change and ensure that professionals can effectively use new systems.
  5. Foster a culture of efficiency
    Encourage a mindset that values efficiency and continuous improvement.
    This means actively seeking feedback from clinical professionals and empowering
    them with the tools and processes they need to succeed.

Get the full report to see the study methodology and more data