An assessment of the informative value of data sharing statements in clinical trial registries

Bibliographic Details
Title: An assessment of the informative value of data sharing statements in clinical trial registries
Authors: Christian Ohmann, Maria Panagiotopoulou, Steve Canham, Gerd Felder, Pablo Emilio Verde
Source: BMC Medical Research Methodology, Vol 24, Iss 1, Pp 1-11 (2024)
Publisher Information: BMC, 2024.
Publication Year: 2024
Collection: LCC:Medicine (General)
Subject Terms: Data sharing, Clinical trial registry, Data sharing statement, Individual participant data, Expert, Observer variation, Medicine (General), R5-920
More Details: Abstract Background The provision of data sharing statements (DSS) for clinical trials has been made mandatory by different stakeholders. DSS are a device to clarify whether there is intention to share individual participant data (IPD). What is missing is a detailed assessment of whether DSS are providing clear and understandable information about the conditions for data sharing of IPD for secondary use. Methods A random sample of 200 COVID-19 clinical trials with explicit DSS was drawn from the ECRIN clinical research metadata repository. The DSS were assessed and classified, by two experienced experts and one assessor with less experience in data sharing (DS), into different categories (unclear, no sharing, no plans, yes but vague, yes on request, yes with specified storage location, yes but with complex conditions). Results Between the two experts the agreement was moderate to substantial (kappa=0.62, 95% CI [0.55, 0.70]). Agreement considerably decreased when these experts were compared with a third person who was less experienced and trained in data sharing (“assessor”) (kappa=0.33, 95% CI [0.25, 0.41]; 0.35, 95% CI [0.27, 0.43]). Between the two experts and under supervision of an independent moderator, a consensus was achieved for those cases, where both experts had disagreed, and the result was used as “gold standard” for further analysis. At least some degree of willingness of DS (data sharing) was expressed in 63.5% (127/200) cases. Of these cases, around one quarter (31/127) were vague statements of support for data sharing but without useful detail. In around half of the cases (60/127) it was stated that IPD could be obtained by request. Only in in slightly more than 10% of the cases (15/127) it was stated that the IPD would be transferred to a specific data repository. In the remaining cases (21/127), a more complex regime was described or referenced, which could not be allocated to one of the three previous groups. As a result of the consensus meetings, the classification system was updated. Conclusion The study showed that the current DSS that imply possible data sharing are often not easy to interpret, even by relatively experienced staff. Machine based interpretation, which would be necessary for any practical application, is currently not possible. Machine learning and / or natural language processing techniques might improve machine actionability, but would represent a very substantial investment of research effort. The cheaper and easier option would be for data providers, data requestors, funders and platforms to adopt a clearer, more structured and more standardised approach to specifying, providing and collecting DSS. Trial registration The protocol for the study was pre-registered on ZENODO ( https://zenodo.org/record/7064624#.Y4DIAHbMJD8 ).
Document Type: article
File Description: electronic resource
Language: English
ISSN: 1471-2288
Relation: https://doaj.org/toc/1471-2288
DOI: 10.1186/s12874-024-02168-8
Access URL: https://doaj.org/article/f2f7618f0d804dbbbdb01165c6415b85
Accession Number: edsdoj.f2f7618f0d804dbbbdb01165c6415b85
Database: Directory of Open Access Journals
Full text is not displayed to guests.
FullText Links:
  – Type: pdflink
    Url: https://content.ebscohost.com/cds/retrieve?content=AQICAHjPtM4BHU3ZchRwgzYmadcigk49r9CVlbU7V5F6lgH7WwEUk2bvAfAZbJ9DOcdf-KvzAAAA4jCB3wYJKoZIhvcNAQcGoIHRMIHOAgEAMIHIBgkqhkiG9w0BBwEwHgYJYIZIAWUDBAEuMBEEDNKVCNNWR4hXX9fmkgIBEICBmjNFiSy6D7mb88B0kUS_-3JvCgY5BxyONCLIOVFHQdm0h1-oTs6JPSbF1rjdawnlrQeDIY659z8ert00Nz40juFKxu1nIvdqyS2oz13lodF616279YwVozx5GL9H3__2g2OPDcZS4_Nnhv60YyOiK_Qz0Vci0Eue7XtXz6mgNqUlZ1qUJ5oRlMPg1W92kATGY6SVLspWzU90t3A=
Text:
  Availability: 1
  Value: <anid>AN0175932243;[1ci1]09mar.24;2024Mar12.05:00;v2.2.500</anid> <title id="AN0175932243-1">An assessment of the informative value of data sharing statements in clinical trial registries </title> <p>Background: The provision of data sharing statements (DSS) for clinical trials has been made mandatory by different stakeholders. DSS are a device to clarify whether there is intention to share individual participant data (IPD). What is missing is a detailed assessment of whether DSS are providing clear and understandable information about the conditions for data sharing of IPD for secondary use. Methods: A random sample of 200 COVID-19 clinical trials with explicit DSS was drawn from the ECRIN clinical research metadata repository. The DSS were assessed and classified, by two experienced experts and one assessor with less experience in data sharing (DS), into different categories (unclear, no sharing, no plans, yes but vague, yes on request, yes with specified storage location, yes but with complex conditions). Results: Between the two experts the agreement was moderate to substantial (kappa=0.62, 95% CI [0.55, 0.70]). Agreement considerably decreased when these experts were compared with a third person who was less experienced and trained in data sharing ("assessor") (kappa=0.33, 95% CI [0.25, 0.41]; 0.35, 95% CI [0.27, 0.43]). Between the two experts and under supervision of an independent moderator, a consensus was achieved for those cases, where both experts had disagreed, and the result was used as "gold standard" for further analysis. At least some degree of willingness of DS (data sharing) was expressed in 63.5% (127/200) cases. Of these cases, around one quarter (31/127) were vague statements of support for data sharing but without useful detail. In around half of the cases (60/127) it was stated that IPD could be obtained by request. Only in in slightly more than 10% of the cases (15/127) it was stated that the IPD would be transferred to a specific data repository. In the remaining cases (21/127), a more complex regime was described or referenced, which could not be allocated to one of the three previous groups. As a result of the consensus meetings, the classification system was updated. Conclusion: The study showed that the current DSS that imply possible data sharing are often not easy to interpret, even by relatively experienced staff. Machine based interpretation, which would be necessary for any practical application, is currently not possible. Machine learning and / or natural language processing techniques might improve machine actionability, but would represent a very substantial investment of research effort. The cheaper and easier option would be for data providers, data requestors, funders and platforms to adopt a clearer, more structured and more standardised approach to specifying, providing and collecting DSS. Trial registration: The protocol for the study was pre-registered on ZENODO (https://zenodo.org/record/7064624#.Y4DIAHbMJD8).</p> <p>Keywords: Data sharing; Clinical trial registry; Data sharing statement; Individual participant data; Expert; Observer variation</p> <p>Supplementary Information The online version contains supplementary material available at https://doi.org/10.1186/s12874-024-02168-8.</p> <hd id="AN0175932243-2">Background</hd> <p>Data sharing (DS) of individual participant data (IPD) in clinical trials has been promoted extensively by different stakeholders and initiatives. Data sharing makes it possible to compare or combine the data from different studies, and to more easily aggregate it for meta-analysis. It allows conclusions to be re-examined and verified or, occasionally, corrected, and it can allow new hypotheses to be tested. Sharing can therefore increase data validity, but it also creates more value from the original research investment, as well as helps to avoid unnecessary repetition of studies [[<reflink idref="bib1" id="ref1">1</reflink>]].</p> <p>In December 2015, ClinicalTrials.gov added two optional registration fields to the protocol registration and results system (PRS) to address IPD sharing: "Plan to Share IPD" and "Available IPD/Information Type". In an exploratory study, investigating the use of these fields in ClinicalTrials.gov, potential confusion in the meaning of the terms "IPD" and "sharing", and inconsistent responses were found in a qualitative analysis [[<reflink idref="bib2" id="ref2">2</reflink>]]. ClinicalTrials.gov added additional subfields to the IPD Sharing Statement section in June 2017 [[<reflink idref="bib3" id="ref3">3</reflink>]]. Currently it includes a primary statement concerning the plan to share IPD (yes, no, undecided), followed by more specific information in case of "yes" ("Plan Description, "IPD Sharing", "Supporting Information", "Time Frame", "Access Criteria" and "URL"). It was expected that by adding additional subfields with greater structure more clear and complete IPD-sharing plans would be provided [[<reflink idref="bib2" id="ref4">2</reflink>]].</p> <p>In 2017, the International Committee of Medical Journal Editors (ICMJE) required as of 1 July 2018 for manuscripts submitted to ICMJE journals on clinical trials to contain a data sharing statement (DSS) and for clinical trials enrolling participants after 1 January 2019 to include a DSS upon their registration. Individual examples of DSS that fulfil the ICMJE requirements are given [[<reflink idref="bib4" id="ref5">4</reflink>]]. The implementation of ICMJE data-sharing requirements in online journal policies was suboptimal for ICMJE-member journals and poor for ICMJE-affiliated journals [[<reflink idref="bib5" id="ref6">5</reflink>]]. Most trials published in the Journal of the American Medical Association (JAMA), the Lancet, and the New England Journal of Medicine (NEJM) after the implementation of the ICMJE policy declared their intent to make clinical trial data available. However, a wide gap between declared and actual data sharing still exists [[<reflink idref="bib6" id="ref7">6</reflink>]].</p> <p>The DSS in clinical trial registries have been analysed with respect to willingness to share IPD. Several studies have been performed calculating the percentage of studies with intention to share IPD, sometimes dependent on factors such as geographical location, study type and sponsor type (see Table four in the discussion section). Analysing the supporting information in the DSS of clinical trial registrations allows for the detailed operation and procedure of DS to be assessed. Earlier studies have sometimes found inconsistencies when comparing a generic "yes" for DS with the supplementary text and had to re-classify the assessment after exploring that text. The corrections made were, however, minor and in the range of 1-2% (Table four). What is still missing is an analysis that investigates whether the DSS are providing clear and understandable information for DS that better supports researchers in identifying clinical trials with availability of IPD for secondary use. Such an analysis could answer the following questions:</p> <p></p> <ulist> <item> Does the text clearly state that IPD sharing is supported?</item> <p></p> <item> Are the conditions/requirements for IPD sharing clearly and understandably formulated?</item> <p></p> <item> Are repositories where the data is to be stored named?</item> <p></p> <item> Are the DSS statements capable of being interpreted, at least in part, by a machine?</item> </ulist> <p>We therefore conducted a study with the primary objective of developing and evaluating a classification system for DSS of registered trials, characterising the degree and type of willingness for DS, and the extent and clarity of supporting information. We focused on COVID-19 studies, partly to keep the study within reasonable bounds, partly to ensure the focus was on recent studies, where expectation of a DSS being present are higher, and partly because we were, at the time, involved with other work supporting data sharing in COVID related clinical research. Our secondary objective was to therefore to develop a mechanism to identify COVID-19 trials with a high degree of willingness, and transparent conditions and requirements, for sharing IPD for secondary use.</p> <hd id="AN0175932243-3">Methods</hd> <p></p> <hd id="AN0175932243-4">Eligibility criteria</hd> <p>The following eligibility criteria for selecting clinical trials / clinical studies were used:</p> <p></p> <ulist> <item> Clinical trials/clinical studies about COVID-19 or SARS-Cov-2.</item> <p></p> <item> Explicit DSS available in the trial registration data.</item> </ulist> <p>Both completed and ongoing studies were considered.</p> <hd id="AN0175932243-5">Information sources</hd> <p>The identification of COVID-19 trials was performed with a metadata repository (MDR) developed by the European Research Infrastructure Network (ECRIN). The principal aim of the MDR is to make the data objects generated from clinical research easier to locate, and to describe how each of those data objects can be accessed, providing direct links to them where that is possible.</p> <p>The central idea is to develop systems that can collect the metadata about the studies and their linked data objects, including object provenance, location, and access details, from a variety of source systems (e.g., trial registries, data repositories, bibliographic systems) and aggregate it into a single repository, the MDR. The MDR standardises the metadata and provides access to it through a single system, accessed via a web portal [[<reflink idref="bib8" id="ref8">8</reflink>]] (https://newmdr.ecrin.org/). The metadata schemas, the data structures, and the data extraction are described in the project's 'About' page [[<reflink idref="bib9" id="ref9">9</reflink>]].</p> <p>The MDR currently covers the following trial registries:</p> <p></p> <ulist> <item> ClinicalTrials.gov</item> <p></p> </ulist> <p>• EUCTR</p> <p></p> <p>• ISRCTN</p> <p></p> <ulist> <item> WHO ICTRP (providing access to data from a further 15 national repositories)</item> </ulist> <p>In addition, the following repositories and other data sources are covered:</p> <p></p> <p>• Pubmed</p> <p></p> <ulist> <item> BioLINCC</item> <p></p> </ulist> <p>• YODA</p> <p>Additional file 1 provides some brief further information on the MDR's information sources. The MDR is updated weekly. As of November 2023, it included metadata from over 800,000 clinical trials / clinical studies and from over 1,300,000 related digital objects. In recent years the number of studies, globally, has grown by about 60,000 studies per year.</p> <hd id="AN0175932243-6">Sample selection</hd> <p>The study selection process is summarised in Fig. 1. The initial search was done directly on the underlying data of the MDR, using Structured Query Language (SQL) statements against the database rather than the portal. The search strategy was based on the four terms:</p> <p></p> <hd1 id="AN0175932243-7"> • 'covid', </hd1> <p></p> <hd1 id="AN0175932243-8"> • 'coronavirus', </hd1> <p></p> <hd1 id="AN0175932243-9"> • 'sars-2' </hd1> <p></p> <hd1 id="AN0175932243-10"> • 'sars2' </hd1> <p>Graph: Fig. 1Selection of clinical studies for DSS assessment (according to PRISMA 2020 flow diagram, [ [<reflink idref="bib10" id="ref10">10</reflink>] ])</p> <p>The initial selection was of all studies that included one or more of the four terms listed above, either in their titles (all titles were considered) or their related 'topics' (associated keywords, including listed target conditions). Then those with an <emph>explicit</emph> DS statement, defined:</p> <p></p> <ulist> <item> for ClinicalTrials.gov studies, as those that provided more details than the simple initial response of "Yes", 'No' or 'Undecided';</item> <p></p> <item> for data from other registries, as those that had any response at all, as a free text statement</item> </ulist> <p>were then identified as suitable for DSS categorisation. The search code is presented in the Additional file 2.</p> <p>All trials identified as having the COVID-19 linkage criteria, and an explicit DSS (<emph>n</emph>=2,953) were taken into consideration for the study. A subset of the fields extracted from these studies were then extracted into a spreadsheet for easier viewing and analysis. The fields used were:</p> <p></p> <ulist> <item> Study Id (of the record in the MDR)</item> <p></p> <item> Source Id (a registry Id)</item> <p></p> <item> Id in source (i.e. the trial registry Id)</item> <p></p> <item> Study display (default) title</item> <p></p> <item> Brief description</item> <p></p> </ulist> <p>• DSS</p> <p>At this stage no attempt was made to distinguish negative from positive DSS. From the full list, a random sample of 200 studies was taken for the study, using the random number generator in Excel. The sample was not stratified according to the length of the DSS – it is a simple random sample from the whole source population.</p> <hd id="AN0175932243-11">Process of DSS assessment and data collection</hd> <p>Two DS experts and one further person ("assessor") from ECRIN were involved in the study. Information about the experts and the assessor is included in "Author information".</p> <p>The three persons independently explored the DSS of the 200 selected studies and provided an assessment according to the following classification (Table 1). Examples were provided to help guide the assessors apply the categories and are also included below, though often with less relevant portions removed to save space. Note that the "(As of <date>)" prefix on many of the statements is added by MDR processing and was not part of the original statement.</p> <p>Table 1 Classification system for DSS (version 1)</p> <p> <ephtml> <table frame="hsides" rules="groups"><thead><tr><th align="left"><p><bold>#</bold></p></th><th align="left"><p><bold>Unclear</bold></p></th><th align="left"><p>DSS not understood, and in particular the plans for sharing IPD are not clear or understood.</p></th></tr></thead><tbody><tr><td align="left" colspan="3"><p><bold>Examples</bold>:</p></td></tr><tr><td align="left" colspan="3"><p> 1) "Centers will receive your center-specific evaluation via the publication and via the study groups."</p><p> 2) "(As of June 2020): All researchers in this study"</p><p> 3) "Personal reasons"</p><p> 4) "The study is under reviewed by medical journal reviewer."</p></td></tr><tr><td align="left"><p><bold>N</bold></p></td><td align="left"><p><bold>No sharing</bold></p></td><td align="left"><p>DSS states or implies that there will be no sharing of IPD. Different specific reasons may be provided.</p></td></tr><tr><td align="left" colspan="3"><p><bold>Examples</bold>:</p></td></tr><tr><td align="left" colspan="3"><p> 1) "(As of April 2022): We will not plan to share IPD",</p><p> 2) "The quantitative dataset will be provided by the service organisation in Leeds and they do not want this data to be made publicly available as it has some commercial sensitivity for them. ... ... IPD: Not expected to be available",</p><p> 3) "Transfer of data to third parties excluded",</p><p> 4) "Protect Patient's right EC's policy."</p></td></tr><tr><td align="left"><p><bold>0</bold></p></td><td align="left"><p><bold>No plans</bold></p></td><td align="left"><p>DSS states that there are currently no plans regarding IPD sharing, or a decision has not yet been made</p></td></tr><tr><td align="left" colspan="3"><p><bold>Examples</bold>:</p></td></tr><tr><td align="left" colspan="3"><p> 1) "(As of September 2021): To be determined",</p><p> 2) "(As of December 2021): It is not yet clear, if the data will be shared or not.",</p><p> 3) "Need a consensus from colleagues",</p><p> 4) "(As of March 2022): There is no default plan to share individual participant data. May be considered upon reaching the VCU IRB"</p></td></tr><tr><td align="left"><p><bold>V</bold></p></td><td align="left"><p><bold>Vaguely positive</bold></p></td><td align="left"><p>DSS states or more normally implies that there will be some degree of IPD sharing, but no details are given or sharing appears limited.</p></td></tr><tr><td align="left" colspan="3"><p><bold>Examples</bold>:</p></td></tr><tr><td align="left" colspan="3"><p> 1) "IPD sharing statement: The current data-sharing plans for this study are unknown and will be available at a later date. IPD: To be made available at a later date",</p><p> 2) "(As of July 2020): demographic data of the participants and diagnostic test results will be shared",</p><p> 3) "(As of February 2021): Anonymized and pseudonymized data will be published in peer reviewed journals and may be presented at congresses and conferences",</p><p> 4) "(As of January 2022): The investigators will be sharing the data, but the management plan is being designed. Information available: Study Protocol"</p></td></tr><tr><td align="left"><p><bold>R</bold></p></td><td align="left"><p><bold>By request</bold></p></td><td align="left"><p>DSS implies that IPD can be obtained by request, though who to may not be explicitly stated. If stated it is usually the investigator, sometimes the sponsor.</p></td></tr><tr><td align="left" colspan="3"><p><bold>Examples</bold>:</p></td></tr><tr><td align="left" colspan="3"><p> 1) "(As of December 2021): Upon request, deidentified participant data collected during the study may be shared. ...... Data will be made available upon request beginning 3 months following publication of the final article from this study, with no end date. ... ... Data requestors will need to sign a data access agreement.",</p><p> 2) "What will be shared: All data can be shared after patients are made unidentifiable. When: Data can be accessible 6 months after results are published... .... Where to obtain: Data can be accessible through an email to the corresponding author. How to obtain: After sending a request email to the corresponding author, data will be sent in 1 month... ....",</p><p> 3) "(As of August 2021): Individual data collected during the trial will be available after deidentification upon request to the PI. ... ...Access Criteria: Deidentified data will be shared with any researcher requesting access. ... ..."</p><p> 4) "(As of October 2021): Upon reasonable request, IPD may be shared if fully compliant with HREC requirements and European GRDP regulations"</p></td></tr><tr><td align="left"><p><bold>C</bold></p></td><td align="left"><p><bold>Complex</bold></p></td><td align="left"><p>DSS states or implies that IPD will be available but under a more complex regime than any of those described above.</p></td></tr><tr><td align="left" colspan="3"><p><bold>Examples</bold>:</p></td></tr><tr><td align="left" colspan="3"><p> 1) "De-identified IPD that underlies the results will be shared after publication of primary results and made available indefinitely to investigators whose proposed research have received ethical approval following execution of a data use agreement",</p><p> 2) "What will be shared: All data is shareable. When: Access period starts 6 months after the results are published. To whom: It will be available only to researchers working in academic and scientific institutions. Conditions: It is permissible to use this data to find patients who respond better to this treatment. Where to obtain: Contact ... .... How to obtain: The request will be answered in a week in consultation with other partners in the project. ... ...",</p><p> 3) "What data will be shared? Outcome data that underlie the results reported in the article (text, tables, figures and appendices) ... When will data be available (start and end dates)? Beginning 12 months following analysis and article publication No end date Available to whom? Researchers from a recognised research institution whose proposed use of the data has been ethically reviewed and approved by an independent committee and who accept The University of Melbourne's conditions for access Available for what types of analyses? Unspecified By what mechanism will data be made available? Written request to principal investigator at ... ...",</p><p> 4) "What will be shared: All related data will be provided upon completion of the study after disidenti?ed (sic) IPD When: The related data will be inde?nitely (sic) provided upon completion of the study. To whom: Relevant information will be provided to all interested parties for humanitarian study and research purposes upon completion of the study. Conditions: On the condition that if any of the study methods (including study design and implementation method, measurement methods, etc.) are used the copyright law be observed and our study be cited. Where to obtain: This will be provided upon completion of the study. Contact: ... How to obtain: Contact via email ... "</p></td></tr><tr><td align="left"><p><bold>S</bold></p></td><td align="left"><p><bold>Via Storage</bold></p></td><td align="left"><p>DSS states that IPD will be transferred to a repository – may be general, specific or institutional. More often not named specifically.</p></td></tr><tr><td align="left" colspan="3"><p><bold>Examples</bold>:</p></td></tr><tr><td align="left" colspan="3"><p> 1) "(As of September 2020): The results of this study will be published as open access article and all IPD that underlie results in a publication will be shared, preferentially as supplementary data set or available in a public repository. Any sensitive personal data/information will be not included.",</p><p> 2) "General dissemination plan: Planned publication in a high-impact peer-reviewed journal. IPD: Stored in non-publicly available repository",</p><p> 3) "(As of February 2022): The fully anonymised datasets analysed during the study will be stored on a publicly available repository. The COG-UK HOCI study to be shared on the UCL Data Repository data-sharing platform so that the data may be reused by other researchers. ... ...Time frame: This will be done with 6 months of public reporting of results, with data available for 5 years. Access Criteria: Fully open access",</p><p> 4) "(As of June 2021): The anonymized database will be uploaded into a public repository. Time frame: The database will be available upon publication of the results. Access Criteria: The data will be available in a public repository ..."</p></td></tr></tbody></table> </ephtml> </p> <p>To develop the classification system, a preliminary search had been performed previously for completed COVID-19 trials, again with the support of the MDR. 589 studies were identified with explicit DSS. From these studies 30 were analysed manually by two experts with respect to intention to share IPD from COVID-19 trials. From the analysis, version 1 of the classification of the DSS was derived and used as input for this study.</p> <p>The process of DSS assessment and data collection consisted of three steps:</p> <p></p> <ulist> <item> A training session, where the DSS category system, assessment procedure and the data collection process was introduced to the two DS experts and the assessor (virtual meeting).</item> <p></p> <item> Assessment of the DSS, carried out independently by each expert/assessor from home.</item> <p></p> <item> A consensus meeting between the two experts to derive a consensus in case of disagreement (virtual meeting).</item> </ulist> <p>A sample size of 200 was selected to give a 95%-confidence interval of around 10% for a given proportion. For a predicted agreement rate between two experts of 80%, the 95%-confidence interval will be between 74% and 86%, using the asymptomatic Wald method.</p> <hd id="AN0175932243-12">Study risk of bias assessment</hd> <p>A risk of biased assessment was not applied in this study.</p> <hd id="AN0175932243-13">Effect measures</hd> <p>The main outcome of the pilot study is the inter-observer variability between two experts in the assessment of the DSS. In addition, the assessment of each individual expert and the assessor is compared.</p> <p>A further outcome criterion is the agreement between the expert(s) and the consensus ("source of truth"/ "gold standard").</p> <hd id="AN0175932243-14">Synthesis methods</hd> <p>The bilateral assessments of the two experts and the assessor were tabulated in a cross table and a summary statistic of the agreement/disagreement rate between two persons was generated.</p> <p>For measuring interrater reliability between the assessment of any two persons the kappa coefficient developed by Fleiss was applied [[<reflink idref="bib11" id="ref11">11</reflink>]], the estimate and the 95%-confidence intervals (CIs) were reported. The statistical analysis was performed using the statistical software R (version 4.2.0).</p> <p>In cases of disagreement in the categorisation, the two experts from ECRIN met with a third independent person to find a consensus (virtual meeting), preferably by agreement and if not possible, by majority. The result of this consensus process was documented, and the assessment of each expert was compared with the consensus.</p> <hd id="AN0175932243-15">Registration and protocol</hd> <p>The protocol for the study was pre-registered on ZENODO [[<reflink idref="bib10" id="ref12">10</reflink>]] (https://zenodo.org/record/7064624#.Y4DIAHbMJD8). The protocol follows the PRISMA 2020 checklist [[<reflink idref="bib12" id="ref13">12</reflink>]].</p> <hd id="AN0175932243-16">Results</hd> <p></p> <hd id="AN0175932243-17">Comparison between experts</hd> <p>As per the protocol a (virtual) training session was performed, where the assessment of DSS was introduced and demonstrated with examples (9 September 2022). Thereafter, the assessment procedure for the 200 sources began. The assessment was finalised by expert A and B on 19 September and by assessor C on 20 September 2022. Expert A needed 2 hours for assessment, expert B 12 hours and assessor C 6 hours.</p> <p>Detailed comparisons of the categorisations provided by expert A, expert B and assessor C are provided in Additional file 3. For experts A and B, (Table A1a), the overall agreement is 70% (139/200). The estimated kappa is 0.62 with a 95% CI [0.55, 0.70], clearly significant. The results show that the agreement is in the range of moderate to substantial.</p> <p>The results are different when the assessment of experts A and B is compared to assessor C,). Overall agreement of assessor C against expert A is 42% (83/200), with an estimated kappa of 0.33 and a 95% CI [0.25, 0.41]. Against expert B the corresponding figures are 44% (88/200), with an estimated kappa of 0.35 and a 95% CI [0.27, 0.43]. In both cases the results clearly indicate the agreement is low. In Additional file 4 an explanatory analysis is presented, where the cross tables have been condensed in different ways.</p> <hd id="AN0175932243-18">Consensus agreement</hd> <p>Two virtual meetings were performed (15 and 16 November 2022) to achieve consensus for the cases where expert A and B disagreed (<emph>n</emph>=61). In contrast to the study protocol, assessor C was not included in these meetings and did not participate in the consensus process. Agreement of assessor C with the experts A and B was low and analysis revealed some systematic deviations, indicating that more training and experience would have been necessary for assessor C to produce comparable results.</p> <p>The consensus meetings were chaired by an independent clinical research expert (C.O.). In all cases agreement could be achieved between the experts A and B and the chair after discussion of the individual DSS and the assessments by the experts.</p> <p>The results from the consensus are presented in Table 2:</p> <p>Table 2 Classification of the 200 clinical studies after experts' consensus</p> <p> <ephtml> <table frame="hsides" rules="groups"><thead><tr><th align="left"><p><bold>Classification</bold></p></th><th align="left"><p><bold>Specification</bold></p></th><th align="left"><p><bold>Number of cases</bold></p></th></tr></thead><tbody><tr><td align="left"><p>Unclear</p></td><td align="left"><p>-</p></td><td align="left"><p>13% (25/200)</p></td></tr><tr><td align="left"><p>No data sharing</p></td><td align="left"><p>-</p></td><td align="left"><p>14% (27/200)</p></td></tr><tr><td align="left"><p>No plans</p></td><td align="left"><p>-</p></td><td align="left"><p>11% (21/200)</p></td></tr><tr><td align="left"><p>Yes</p></td><td align="left"><p>Vague</p></td><td align="left"><p>16% (31/200)</p></td></tr><tr><td align="left" /><td align="left"><p>Defined request conditions</p></td><td align="left"><p>30% (60/200)</p></td></tr><tr><td align="left" /><td align="left"><p>Defined storage location</p></td><td align="left"><p>8% (15/200)</p></td></tr><tr><td align="left" /><td align="left"><p>Complex conditions</p></td><td align="left"><p>11% (21/200)</p></td></tr><tr><td align="left"><p>Total</p></td><td align="left"><p>-</p></td><td align="left"><p>200</p></td></tr></tbody></table> </ephtml> </p> <p>The assessment of experts A and B were compared with the consensus result. The results are shown in Additional file 3.Overall agreement for expert A was 87% (173/200), and for expert B it was 82% (163/200). The estimated kappa is 0.83 with a 95% CI [0.78, 0.89] for expert A, and 0.77 with a 95% CI [0.71, 0.84] for expert B. In both cases, these results can be classified as an <emph>almost perfect</emph> level of agreement. The individual assessments of the experts and the consensus are listed in Additional file 5.</p> <p>After the consensus meetings and the intensive discussion, the classification system, whilst retaining the same categories, was extended with clearer definitions (see Table 3):</p> <p>Table 3 Classification system for DSS (version 2)</p> <p> <ephtml> <table frame="hsides" rules="groups"><tbody><tr><td align="left"><p><bold>#</bold></p></td><td align="left"><p><bold>Unclear</bold></p></td><td align="left"><p>DSS not understood. Either the statement makes no sense (e.g., is not coherent English), or IPD is not mentioned at all, or IPD is mentioned but the intention to share or not share is not made clear, even in the vaguely positive sense described by 'vaguely positive' below.</p><p>N.B. Responses that indicate that 'demographic data' or 'clinical data' or similar will be shared are ambiguous – the data could be summary tables rather than IPD. Unless clarified elsewhere in the DSS, therefore, such responses fall into this category. In all cases the intention, or not, to share IPD for secondary use is simply unknown.</p></td></tr><tr><td align="left"><p><bold>N</bold></p></td><td align="left"><p><bold>No sharing</bold></p></td><td align="left"><p>DSS states or implies that there will be no sharing of IPD. Different specific reasons may be provided. Usually an explicit statement is made (e.g. "transfer of data to third parties excluded") but sometimes a reference is made to an implied reason for not sharing ("patients' confidentiality"). DSS that refer <italic>only</italic> to sharing of aggregate or results data, or <italic>only</italic> to publication of a journal paper are also assumed to fall into the 'no sharing' category.</p></td></tr><tr><td align="left"><p><bold>0</bold></p></td><td align="left"><p><bold>No plans</bold></p></td><td align="left"><p>DSS states that there are currently no plans regarding IPD sharing, or a decision has not yet been made. Usually, an explicit statement but may be implicit in statements such "need to consult with collaborators / participants". N.B. "Not yet planned..." would fall into this category, while "not planned..." would be classified as 'no sharing'.</p></td></tr><tr><td align="left"><p><bold>V</bold></p></td><td align="left"><p><bold>Vaguely positive</bold></p></td><td align="left"><p>Vaguely positive. DSS states or implies that there will be some degree of IPD sharing, but no details are given or sharing appears limited, e.g. "To be made available at a later date". This is the weakest of the positive statements. Sometimes the intention is made clear but the details are not yet available – e.g. "The investigators will be sharing the data, but the management plan is being designed."</p></td></tr><tr><td align="left"><p><bold>R</bold></p></td><td align="left"><p><bold>By request</bold></p></td><td align="left"><p>DSS states that IPD can be obtained by request, usually as an explicit statement, occasionally just as an email address. Few further details about the access process are provided and there are no additional prerequisites identified (other than, sometimes, the fact that a Data Use Agreement must be signed). The contact person is usually the investigator, sometimes the sponsor, but the role may not be stated.</p></td></tr><tr><td align="left"><p><bold>C</bold></p></td><td align="left"><p><bold>Complex</bold></p></td><td align="left"><p>DSS states or implies that IPD will be available and describes or references a more complex regime than a simple request. Common additional requirements are for requests to be reviewed by a data access or trial committee, for the applicants to have an approved protocol, or to follow a named procedure set out in a document or web page. These DSS are therefore positive but provide more detail than simply stating the data can be requested.</p></td></tr><tr><td align="left"><p><bold>S</bold></p></td><td align="left"><p><bold>Via Storage</bold></p></td><td align="left"><p>DSS explicitly states that IPD will be transferred to a specific data repository – may be general, specific to clinical research or institutional - which would normally be named. The data repository (e.g., CSDR, Vivli, or Yoda) should have its own access policies and procedures, and not be just a repository for publicly available documents (e.g., Zenodo). This is a version of the 'complex+' category above, but with the additional requirements partially provided by repository systems.</p></td></tr></tbody></table> </ephtml> </p> <hd id="AN0175932243-19">Discussion</hd> <p>The willingness to share IPD according to DSS in trial registrations varies between 4.8% and 15.7% [[<reflink idref="bib1" id="ref14">1</reflink>], [<reflink idref="bib6" id="ref15">6</reflink>], [<reflink idref="bib13" id="ref16">13</reflink>]–[<reflink idref="bib18" id="ref17">18</reflink>]] (Table 4). In a retrospective cohort study performed with the Australian New Zealand Clinical Trials Registry (ANZTR), the commitment to share data was 22% [[<reflink idref="bib7" id="ref18">7</reflink>]]. A study with African study registers revealed nearly complete willingness to share data, not in line with studies from elsewhere in the world [[<reflink idref="bib17" id="ref19">17</reflink>]].</p> <p>Table 4 Studies investigating the willingness to share IPD in clinical trial registries</p> <p> <ephtml> <table frame="hsides" rules="groups"><thead><tr><th align="left"><p><bold>Author, year</bold></p></th><th align="left"><p><bold>Time</bold></p></th><th align="left"><p><bold>Resource</bold></p></th><th align="left"><p><bold>Trial type</bold></p></th><th align="left"><p><bold>"yes" in DSS field of trial registration</bold></p></th></tr></thead><tbody><tr><td align="left"><p>Bergeris et al., 2018 [<xref ref-type="bibr" rid="bibr2">2</xref>]</p></td><td align="left"><p>January 2016 to August 2017</p></td><td align="left"><p>CT.gov</p></td><td align="left"><p>Interventional trial</p></td><td align="left"><p>10.9% (2782/25551)</p></td></tr><tr><td align="left"><p>Tan et al., 2021 [<xref ref-type="bibr" rid="bibr7">7</xref>]</p></td><td align="left"><p>December 2018 to November 2019</p></td><td align="left"><p>ANZTR</p></td><td align="left"><p>Interventional trial</p></td><td align="left"><p>22% (329/1517) commitment to share data</p></td></tr><tr><td align="left"><p>Statham et al, 2020 [<xref ref-type="bibr" rid="bibr13">13</xref>]</p></td><td align="left"><p>January 2018 to June 2018</p></td><td align="left"><p>CT.gov</p></td><td align="left"><p>Interventional trial</p></td><td align="left"><p>5.5% (112/23040)</p></td></tr><tr><td align="left"><p>Li et al., 2021 [<xref ref-type="bibr" rid="bibr14">14</xref>]</p></td><td align="left"><p>Before 30 June 2020</p></td><td align="left"><p>CT.gov</p></td><td align="left"><p>COVID-19, interventional trial</p></td><td align="left"><p>15.7% (145/924) 17.3% (159/924) after re-classification</p></td></tr><tr><td align="left"><p>Larsson et al, 2022 [<xref ref-type="bibr" rid="bibr15">15</xref>]</p></td><td align="left"><p>Up to September 2021</p></td><td align="left"><p>CT.gov</p></td><td align="left"><p>COVID-19, interventional trial</p></td><td align="left"><p>15% (417/2759)</p><p>15.7% (432/2759) after re-classification</p></td></tr><tr><td align="left"><p>Merson et al., 2022 [<xref ref-type="bibr" rid="bibr16">16</xref>]</p></td><td align="left"><p>January 2019 to December 2020</p></td><td align="left"><p>WHO ICTRP</p></td><td align="left"><p>Registered trial</p></td><td align="left"><p>4.8% (28,684/593,595)</p></td></tr><tr><td align="left"><p>Malinga, et al., [<xref ref-type="bibr" rid="bibr17">17</xref>]</p></td><td align="left"><p>From 2019 to September 2022</p></td><td align="left"><p>PACTR, SANCTR</p></td><td align="left"><p>Registered trial</p></td><td align="left"><p>97% (1763/1818 in PACTR</p><p>100% (477/477) in SANCTR</p></td></tr><tr><td align="left"><p>Xu et al., 2020 [<xref ref-type="bibr" rid="bibr18">18</xref>]</p></td><td align="left"><p>Up to 31 December 2018</p></td><td align="left"><p>WHO ICTRP</p></td><td align="left"><p>Global registration of studies sponsored by China</p></td><td align="left"><p>Around 5%</p></td></tr></tbody></table> </ephtml> </p> <p>("yes" in the DDS field "Plan to share IPD (yes, no, undecided) <emph>DSS</emph> Data sharing statement</p> <p>We applied a slightly different search algorithm to the MDR on 8 January 2023, this time including all COVID-19 studies with an explicit DSS, but also <emph>including</emph> those who simply said "Yes", 'No' or 'Undecided' without providing any further details. This was to obtain a sample more similar to that used by other studies. This provided 7618 trials, of which 6203 were registered in ClinicalTrials.gov. From these trials 16.55% indicated a "yes" in the DSS. This is similar to the studies from Li and Larson [[<reflink idref="bib14" id="ref20">14</reflink>]].</p> <p>Table 4, summarising existing studies, shows that willingness to share IPD is between around 5 and 20%, so still very low. Our study has shown that even when willingness to share data is formulated, the DSS is vague or not specific in most cases, if the DSS is examined in more detail. So, there is doubt in a significant number of cases, whether declared willingness to share data really lead to active sharing.</p> <p>In our study we looked at the structure and content of explicit DSS and tried to derive clearer and more specific categories related to DS, helping the data requestor to get a clearer picture of the DS procedure. It turned out that some DSS are not understandable and remain "unclear", even after close inspection. Some DSS indicate clearly that there is no willingness of data sharing ("no data sharing") and in some "no plans" could be elucidated (corresponding to "undecided"). With respect to a positive indication of DS, we found a graduation of statements, starting from "yes but vague" via "yes with defined request conditions" to "yes, already with a defined storage location". Apart from that, there were some DSS, which were formulated in a way that the concrete conditions of DS could not be uniquely identified by experts. These cases were put into a category "complex".</p> <p>It had been hoped that with the help of the classification system the proportions of DSS expressing willingness for DS, supported by clear and understandable criteria for potential users, could be clarified. Before a classification system can be applied to create new knowledge, however<bold>,</bold> it should be validated. In our case we decided to involve DS experts, let them apply the classification system to a random sample of DSS and then have a look at the degree of observer variation. It turned out that there was moderate to substantial agreement between the experts (kappa=0.62, overall agreement=70%), indicating that the system could be of value when applied by DS experts. Agreement considerably decreased when these experts were compared with a third person, less experienced and trained in data sharing (the "assessor"). Here the agreement was low (kappa=0.33/0.35, overall agreement= 42%/44%). This indicates that without adequate training and experience specific in data sharing terminologies, the classification system will be of limited value. It seems to be that for non-expert users, DSS may be interpreted with substantial variability and uncertainty.</p> <p>To progress and to apply the classification system, consensus between the two DS experts was sought and taken as the gold standard for further analysis. The overall agreement of the experts with the consensus was higher than 80% with almost perfect agreement in the kappa analysis. This is promising but not surprising because the same sample was analysed again, and the results certainly need confirmation in an independent prospective study. Examining the consensus results, we found that in 63.5% (127/200) of the cases at least some degree of willingness of DS was indicated. From these statements around one quarter (31/127) were vague, stating or implying that there will be some degree of IPD sharing, but no details are given or sharing appears limited. In around half of the cases (60/127) it was stated that IPD could be obtained by request, usually as an explicit statement, and only in slightly more than 10% of the cases (15/127) it was stated that the IPD would be transferred to a specific data repository. In the remaining cases (21/127), a more complex regime was described or referenced, which could not be allocated to one of the other categories. This analysis shows that even if willingness to DS is declared, the DSS is often of limited help with respect to the concrete conditions and requirements for DS.</p> <p>Our study shows that trained and experienced experts can classify DSS and filter out the more promising ones for data reuse. It also provides some insight to the research community on the importance of making clear statements with respect to IPD sharing, and the current level of such statements.</p> <p>The difficulty is, of course, that the volume of clinical research work and publications is such that no collection of experts would ever be able to provide or maintain a comprehensive categorisation of DSS on a manual basis. In the future it may be possible to apply new techniques, similar to the text-mining algorithms that screens biomedical publications and detect cases of open data (ODDPub - Open Data detection in Publications, [[<reflink idref="bib19" id="ref21">19</reflink>]]), and that could be used to assess data sharing rates on the level of subject areas, journals, or institutions.</p> <p>It would be interesting to see whether the classification system developed could be supported and applied semi-automatically through involvement of natural language processing (NLP) techniques, utilising machine learning (ML) or Artificial Intelligence (AI). Here the consensus developed by the experts might serve as the core of a "gold standard" for comparison with the results from a machine categorisation.</p> <hd id="AN0175932243-20">Conclusions</hd> <p>To achieve an optimal clinical research data sharing environment, both for IPD meta-analysis and other secondary research purposes, it is important that the sources of potential data sharing are identifiable and that the processes required to apply for IPD are clear. To improve transparency and data reuse, journals should promote the use of unique pointers to data set location and standardized choices for embargo periods and access requirements [[<reflink idref="bib6" id="ref22">6</reflink>]]. But the Data Sharing Statements provided by researchers, and found in trial registries as well as journal papers, also need to be clear and easily categorizable, so that researchers can quickly identify potential resources.</p> <p>The study has shown that, even when DSS are semi-structured, it is currently not easy for even relative experts to clearly and consistently identify what many DSS mean. A substantial portion of DSS are still unclear, vague, or rather complex. In addition, any practical system of classification would need DSS to be machine processable, and that is far from the case at the moment. It would be much more efficient, and some degree of machine processing much more possible, if data providers, data requestors, funders and platforms need to adopt a more 'joined-up' and standardised approach [[<reflink idref="bib20" id="ref23">20</reflink>]].</p> <p>An important initial step towards a clearer and more interpretable DSS is first to require a DSS in all cases. Further steps include being clearer about exactly what information is being requested, improving the structure of the IPD sharing statement and using a standardised and controlled vocabulary for specification of all relevant aspects of the DSS – free text should be minimised. For example, if IPD can be shared by request, the procedure and the institutions involved should be described by selecting adequate options from a pre-defined list.</p> <hd id="AN0175932243-21">Weakness of the study</hd> <p>A weakness of the present study is the limited sample size investigated (<emph>n</emph>=200), which leaves some uncertainty in the conclusions. Nevertheless, the results demonstrate the deficiencies in the current handling of DSS in clinical trial registrations, which needs to be improved.</p> <p>Focusing on COVID 19 related studies may have given an overly positive impression of current data sharing intentions and statements, because of the intense interest in, and substantial encouragement of, data sharing during and immediately after the pandemic. Research in other areas of clinical research is likely to show less intention to share IPD, but further work would be needed to confirm this.</p> <p>In addition, the experts agreeing on the consensus and applying the classification system were the same and no different data sets were used for both exercises, possibly leading to bias. Here, a prospective independent study is needed to confirm the results.</p> <hd id="AN0175932243-22">Acknowledgements</hd> <p>None.</p> <hd id="AN0175932243-23">Authors' contributions</hd> <p>CO coordinated the study, organised and chaired the consensus meeting and provided the final manuscript. SC, MP, GF did the assessment of the DSS and provided input into the manuscript. MP and SC were involved in the consensus meeting. SC provided major input into the development of the classification system. PEV was involved in the statistical planning of the study and did the analysis.</p> <hd id="AN0175932243-24">Authors' information</hd> <p>One expert has 20 years' experience working with clinical research systems and data, after previous roles in healthcare and teaching. He now works for ECRIN on several data related projects and has a particular interest in managing metadata to promote FAIRness of data. The other expert is experienced in the fields of cell biology, nanotechnology, toxicology, genetics, genomics, biomedical imaging, and clinical research. Currently, she is coordinating ECRIN's data projects portfolio focusing on developing tools, good practices and guidelines that serve the European clinical research community.</p> <p>The third person involved, called "assessor", has a background as leading data manager in a clinical trial unit and has recently joined ECRIN to contribute to its data projects portfolio.</p> <hd id="AN0175932243-25">Funding</hd> <p>The study was performed in the context of the BY-COVID project. BY-COVID is funded by the European Union's Horizon Europe research and innovation programme under grant agreement number 101046203. In BY-COVID, secondary use of individual participant COVID-19 vaccine trial data and biosamples is planned (e.g., to test existing vaccines against emerging variants). The current study was performed, to support discovery of completed COVID-19 trials from clinical trial registries, willing to share IPD for secondary use in BY-COVID. The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.</p> <hd id="AN0175932243-26">Availability of data and materials</hd> <p>The distribution of the assessments of the DSS by the experts (anonymised) is included in the tables. To avoid any problems with data privacy and identification of the experts (who are at the same time named co-authors), the individual raw data related to the classification of DSS are not included in the manuscript.</p> <hd id="AN0175932243-27">Declarations</hd> <p></p> <hd id="AN0175932243-28">Ethics approval and consent to participate</hd> <p>Not applicable to this study.</p> <hd id="AN0175932243-29">Consent for publication</hd> <p>Not applicable. Only public information from clinical trial registries was used as input into the assessment exercise.</p> <hd id="AN0175932243-30">Competing interests</hd> <p>The authors declare no competing interests.</p> <hd id="AN0175932243-31">Supplementary Information</hd> <p>Graph: Additional file 1. MDR Information Sources.</p> <p>Graph: Additional file 2. Data extraction and SQL statements.</p> <p>Graph: Additional file 3. Detailed comparisons.</p> <p>Graph: Additional file 4. Exploratory analysis.</p> <p>Graph: Additional file 5. Individual assessment and consensus.</p> <hd id="AN0175932243-32">Abbreviations</hd> <p></p> <p>• AI</p> <p></p> <ulist> <item> Artificial intelligence</item> <p></p> </ulist> <p>• ANZTR</p> <p></p> <ulist> <item> Australian New Zealand Clinical Trials Registry</item> <p></p> <item> BioLINCC</item> <p></p> <item> Biologic Specimen and data Repository Information Coordinating Center</item> <p></p> <item> COVID-19</item> <p></p> <item> Coronavirus disease</item> <p></p> </ulist> <p>• DS</p> <p></p> <ulist> <item> Data sharing</item> <p></p> </ulist> <p>• DSS</p> <p></p> <ulist> <item> Data Sharing Statement</item> <p></p> </ulist> <p>• ECRIN</p> <p></p> <ulist> <item> European Clinical Research Infrastructure Network</item> <p></p> </ulist> <p>• EUCTR</p> <p></p> <ulist> <item> EU Clinical trials Register</item> <p></p> </ulist> <p>• ICJME</p> <p></p> <ulist> <item> International Committee of Medical Journal Editors</item> <p></p> </ulist> <p>• ID</p> <p></p> <ulist> <item> Identifier</item> <p></p> </ulist> <p>• IPD</p> <p></p> <ulist> <item> Individual participant data</item> <p></p> </ulist> <p>• ISRCTN</p> <p></p> <ulist> <item> International Standard Randomised Controlled trial Number</item> <p></p> </ulist> <p>• ICMJE</p> <p></p> <ulist> <item> International Committee of Medical Journal Editors</item> <p></p> </ulist> <p>• JAMA</p> <p></p> <ulist> <item> Journal of the American Medical Association</item> <p></p> </ulist> <p>• MDR</p> <p></p> <ulist> <item> Metadata Repository</item> <p></p> </ulist> <p>• NEJM</p> <p></p> <ulist> <item> New England Journal of Medicine</item> <p></p> </ulist> <p>• NLP</p> <p></p> <ulist> <item> Natural Laguage Processing</item> <p></p> </ulist> <p>• PRS</p> <p></p> <ulist> <item> Protocol registration and results system</item> <p></p> <item> SARS-COV-2</item> <p></p> <item> Severe Acute Respiratory Syndrome Coronavirus-2</item> <p></p> </ulist> <p>• SQL</p> <p></p> <ulist> <item> Structured Query Language</item> <p></p> <item> WHO ICTRP</item> <p></p> <item> World Health Organization. International Clinical Trials Registry Platform</item> <p></p> </ulist> <p>• YODA</p> <p></p> <ulist> <item> Yale University Open Data Access</item> </ulist> <hd id="AN0175932243-33">Publisher's Note</hd> <p>Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.</p> <ref id="AN0175932243-34"> <title> References </title> <blist> <bibl id="bib1" idref="ref1" type="bt">1</bibl> <bibtext> Ohmann C, Banzi R, Canham S. Sharing and reuse of individual participant data from clinical trials: principles and recommendations. BMJ Open. 2017; 7: e018647. 10.1136/bmjopen-2017-018647. 29247106. 5736032</bibtext> </blist> <blist> <bibl id="bib2" idref="ref2" type="bt">2</bibl> <bibtext> Bergeris A, Tse T, Zarin DA. Trialists' intent to share individual participant data as disclosed at ClinicalTrials.gov. JAMA. 2018; 23: 406. 10.1001/jama.2017.20581</bibtext> </blist> <blist> <bibl id="bib3" idref="ref3" type="bt">3</bibl> <bibtext> ClinicalTrials.gov Protocol Registration Data Element Definitions for Interventional and Observational Studies. https://prsinfo.clinicaltrials.gov/definitions.html. Assessed 25 Nov 2022.</bibtext> </blist> <blist> <bibl id="bib4" idref="ref5" type="bt">4</bibl> <bibtext> Taichman DB, Sahni P, Pinborg A. Data sharing statements for clinical trials — a requirement of the international committee of medical journal editors. N Engl J Med. 2018; 376: 2277. 10.1056/NEJMe1705439</bibtext> </blist> <blist> <bibl id="bib5" idref="ref6" type="bt">5</bibl> <bibtext> McDonald L, Schultze A, Simpson A. A review of data sharing statements in observational studies published in the BMJ: a cross-sectional study. F1000Res. 2017; 19: 1708. 10.12688/f1000research.12673.1</bibtext> </blist> <blist> <bibl id="bib6" idref="ref7" type="bt">6</bibl> <bibtext> Danchev V, Yan Min Y, Borghi J. Evaluation of data sharing after implementation of the international committee of medical journal editors data sharing statement requirement. JAMA Network Open. 2021; 4: e2033972. 10.1001/jamanetworkopen.2020.33972. 33507256. 7844597</bibtext> </blist> <blist> <bibl id="bib7" idref="ref18" type="bt">7</bibl> <bibtext> Tan AC, Askie LM, Hunter KE. Data sharing—trialists' plans at registration, attitudes, barriers and facilitators: A cohort study and cross-sectional survey. Research Synthesis Methods. 2021; 12: 5641. 10.1002/jrsm.1500</bibtext> </blist> <blist> <bibl id="bib8" idref="ref8" type="bt">8</bibl> <bibtext> ECRIN Clinical Research Metadata repository. European Clinical Research Network (ECRIN). 2023. https://newmdr.ecrin.org/. Accessed 7th December 2023.</bibtext> </blist> <blist> <bibl id="bib9" idref="ref9" type="bt">9</bibl> <bibtext> ECRIN MDR About page. European Clinical Research Network (ECRIN). 2023. https://newmdr.ecrin.org/About. Accessed 7th December 2023.</bibtext> </blist> <blist> <bibtext> Canham S, Felder G, Ohmann, C. et al. Identification of COVID-19 clinical studies intending to share individual participant data for secondary use: Protocol for a pilot study. Zenodo. 2022. https://doi.org/10.5281/zenodo.7064624. Accessed 18 Jan 2023.</bibtext> </blist> <blist> <bibtext> Fleiss JL. Measuring nominal scale agreement among many raters. Psychological Bulletin. 1971; 76: 378. 10.1037/h0031619</bibtext> </blist> <blist> <bibtext> Page MJ, McKenzie JE, Bossuyt PM. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. PLoS Med. 2021; 18. 10.1371/journal.pmed.1003583. 33780438. 8007028</bibtext> </blist> <blist> <bibtext> Statham, EE, White SA, Sonwane B.et al. Primed to comply: Individual participant data sharing statements on ClinicalTrials.gov. PLoS One: 18;:e0226143. (2020). https://doi.org/10.1371/journal.pone.0226143.</bibtext> </blist> <blist> <bibtext> Li R, von Isenburg M, Levenstein M. COVID-19 trials: declarations of data sharing intentions at trial registration and at publication. Trials. 2021; 18: 153. 10.1186/s13063-021-05104-z</bibtext> </blist> <blist> <bibtext> Larson K, Sim I, von Isenburg M. COVID-19 interventional trials: analysis of data sharing intentions during a time of pandemic. Contemporary Clinical Trials. 2022; 115: 106709. 10.1016/j.cct.2022.106709. 35182738. 8847110</bibtext> </blist> <blist> <bibtext> Merson L, Ndwandwe D, Malinga T. Promotion of data sharing needs more than an emergency: an analysis of trends across clinical trials registered on the International Clinical Trials Registry Platform [version 1; peer review: 2 approved]. Welcome Open Res. 2022; 7: 101. 10.12688/wellcomeopenres.17700.1</bibtext> </blist> <blist> <bibtext> Malinga T, Mathebula L, Duduzile N. IPD sharing plan statement: a clinical trial registry perspective. Cochrane South Africa, South African Medical Research Council, Tygerberg, South Africa. 2023. https://southafrica.cochrane.org/sites/southafrica.cochrane.org/files/uploads/inline-files/Malinga-22.pdf. Assessed 18 Jan 2023.</bibtext> </blist> <blist> <bibtext> Xu Y, Dong M, Liu X. Characteristics and trends of clinical studies primarily sponsored by China in WHO primary registries between 2009 and 2018: a cross-sectional survey. BMJ Open. 2020; 10: e037262. 10.1136/bmjopen-2020-037262. 33444176. 7682458</bibtext> </blist> <blist> <bibtext> Riedel N, Kip M, Bobrov E. ODDPub – a Text-Mining Algorithm to Detect Data Sharing in Biomedical Publications. Data Sci J. 19:42.</bibtext> </blist> <blist> <bibtext> Rydzewska LHM, Stewart LA, Tierney JF. Sharing individual participant data: through a systematic reviewer lens. Trials. 2022; 23: 167. 10.1186/s13063-021-05787-4. 35189931. 8862249</bibtext> </blist> </ref> <aug> <p>By Christian Ohmann; Maria Panagiotopoulou; Steve Canham; Gerd Felder and Pablo Emilio Verde</p> <p>Reported by Author; Author; Author; Author; Author</p> </aug> <nolink nlid="nl1" bibid="bib10" firstref="ref10"></nolink> <nolink nlid="nl2" bibid="bib11" firstref="ref11"></nolink> <nolink nlid="nl3" bibid="bib12" firstref="ref13"></nolink> <nolink nlid="nl4" bibid="bib13" firstref="ref16"></nolink> <nolink nlid="nl5" bibid="bib18" firstref="ref17"></nolink> <nolink nlid="nl6" bibid="bib17" firstref="ref19"></nolink> <nolink nlid="nl7" bibid="bib14" firstref="ref20"></nolink> <nolink nlid="nl8" bibid="bib19" firstref="ref21"></nolink> <nolink nlid="nl9" bibid="bib20" firstref="ref23"></nolink>
CustomLinks:
  – Url: https://resolver.ebsco.com/c/xy5jbn/result?sid=EBSCO:edsdoj&genre=article&issn=14712288&ISBN=&volume=24&issue=1&date=20240301&spage=1&pages=1-11&title=BMC Medical Research Methodology&atitle=An%20assessment%20of%20the%20informative%20value%20of%20data%20sharing%20statements%20in%20clinical%20trial%20registries&aulast=Christian%20Ohmann&id=DOI:10.1186/s12874-024-02168-8
    Name: Full Text Finder (for New FTF UI) (s8985755)
    Category: fullText
    Text: Find It @ SCU Libraries
    MouseOverText: Find It @ SCU Libraries
  – Url: https://doaj.org/article/f2f7618f0d804dbbbdb01165c6415b85
    Name: EDS - DOAJ (s8985755)
    Category: fullText
    Text: View record from DOAJ
    MouseOverText: View record from DOAJ
Header DbId: edsdoj
DbLabel: Directory of Open Access Journals
An: edsdoj.f2f7618f0d804dbbbdb01165c6415b85
RelevancyScore: 1020
AccessLevel: 3
PubType: Academic Journal
PubTypeId: academicJournal
PreciseRelevancyScore: 1020.40380859375
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: An assessment of the informative value of data sharing statements in clinical trial registries
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Christian+Ohmann%22">Christian Ohmann</searchLink><br /><searchLink fieldCode="AR" term="%22Maria+Panagiotopoulou%22">Maria Panagiotopoulou</searchLink><br /><searchLink fieldCode="AR" term="%22Steve+Canham%22">Steve Canham</searchLink><br /><searchLink fieldCode="AR" term="%22Gerd+Felder%22">Gerd Felder</searchLink><br /><searchLink fieldCode="AR" term="%22Pablo+Emilio+Verde%22">Pablo Emilio Verde</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: BMC Medical Research Methodology, Vol 24, Iss 1, Pp 1-11 (2024)
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: BMC, 2024.
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2024
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: LCC:Medicine (General)
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Data+sharing%22">Data sharing</searchLink><br /><searchLink fieldCode="DE" term="%22Clinical+trial+registry%22">Clinical trial registry</searchLink><br /><searchLink fieldCode="DE" term="%22Data+sharing+statement%22">Data sharing statement</searchLink><br /><searchLink fieldCode="DE" term="%22Individual+participant+data%22">Individual participant data</searchLink><br /><searchLink fieldCode="DE" term="%22Expert%22">Expert</searchLink><br /><searchLink fieldCode="DE" term="%22Observer+variation%22">Observer variation</searchLink><br /><searchLink fieldCode="DE" term="%22Medicine+%28General%29%22">Medicine (General)</searchLink><br /><searchLink fieldCode="DE" term="%22R5-920%22">R5-920</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: Abstract Background The provision of data sharing statements (DSS) for clinical trials has been made mandatory by different stakeholders. DSS are a device to clarify whether there is intention to share individual participant data (IPD). What is missing is a detailed assessment of whether DSS are providing clear and understandable information about the conditions for data sharing of IPD for secondary use. Methods A random sample of 200 COVID-19 clinical trials with explicit DSS was drawn from the ECRIN clinical research metadata repository. The DSS were assessed and classified, by two experienced experts and one assessor with less experience in data sharing (DS), into different categories (unclear, no sharing, no plans, yes but vague, yes on request, yes with specified storage location, yes but with complex conditions). Results Between the two experts the agreement was moderate to substantial (kappa=0.62, 95% CI [0.55, 0.70]). Agreement considerably decreased when these experts were compared with a third person who was less experienced and trained in data sharing (“assessor”) (kappa=0.33, 95% CI [0.25, 0.41]; 0.35, 95% CI [0.27, 0.43]). Between the two experts and under supervision of an independent moderator, a consensus was achieved for those cases, where both experts had disagreed, and the result was used as “gold standard” for further analysis. At least some degree of willingness of DS (data sharing) was expressed in 63.5% (127/200) cases. Of these cases, around one quarter (31/127) were vague statements of support for data sharing but without useful detail. In around half of the cases (60/127) it was stated that IPD could be obtained by request. Only in in slightly more than 10% of the cases (15/127) it was stated that the IPD would be transferred to a specific data repository. In the remaining cases (21/127), a more complex regime was described or referenced, which could not be allocated to one of the three previous groups. As a result of the consensus meetings, the classification system was updated. Conclusion The study showed that the current DSS that imply possible data sharing are often not easy to interpret, even by relatively experienced staff. Machine based interpretation, which would be necessary for any practical application, is currently not possible. Machine learning and / or natural language processing techniques might improve machine actionability, but would represent a very substantial investment of research effort. The cheaper and easier option would be for data providers, data requestors, funders and platforms to adopt a clearer, more structured and more standardised approach to specifying, providing and collecting DSS. Trial registration The protocol for the study was pre-registered on ZENODO ( https://zenodo.org/record/7064624#.Y4DIAHbMJD8 ).
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: article
– Name: Format
  Label: File Description
  Group: SrcInfo
  Data: electronic resource
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: ISSN
  Label: ISSN
  Group: ISSN
  Data: 1471-2288
– Name: NoteTitleSource
  Label: Relation
  Group: SrcInfo
  Data: https://doaj.org/toc/1471-2288
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.1186/s12874-024-02168-8
– Name: URL
  Label: Access URL
  Group: URL
  Data: <link linkTarget="URL" linkTerm="https://doaj.org/article/f2f7618f0d804dbbbdb01165c6415b85" linkWindow="_blank">https://doaj.org/article/f2f7618f0d804dbbbdb01165c6415b85</link>
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsdoj.f2f7618f0d804dbbbdb01165c6415b85
PLink https://login.libproxy.scu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsdoj&AN=edsdoj.f2f7618f0d804dbbbdb01165c6415b85
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.1186/s12874-024-02168-8
    Languages:
      – Text: English
    PhysicalDescription:
      Pagination:
        PageCount: 11
        StartPage: 1
    Subjects:
      – SubjectFull: Data sharing
        Type: general
      – SubjectFull: Clinical trial registry
        Type: general
      – SubjectFull: Data sharing statement
        Type: general
      – SubjectFull: Individual participant data
        Type: general
      – SubjectFull: Expert
        Type: general
      – SubjectFull: Observer variation
        Type: general
      – SubjectFull: Medicine (General)
        Type: general
      – SubjectFull: R5-920
        Type: general
    Titles:
      – TitleFull: An assessment of the informative value of data sharing statements in clinical trial registries
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Christian Ohmann
      – PersonEntity:
          Name:
            NameFull: Maria Panagiotopoulou
      – PersonEntity:
          Name:
            NameFull: Steve Canham
      – PersonEntity:
          Name:
            NameFull: Gerd Felder
      – PersonEntity:
          Name:
            NameFull: Pablo Emilio Verde
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 03
              Type: published
              Y: 2024
          Identifiers:
            – Type: issn-print
              Value: 14712288
          Numbering:
            – Type: volume
              Value: 24
            – Type: issue
              Value: 1
          Titles:
            – TitleFull: BMC Medical Research Methodology
              Type: main
ResultId 1