Managing Your Data
- Home
- Care for your dataToggle Dropdown
- Archive and preserve your dataToggle Dropdown
- Share your data
- Write your data management planToggle Dropdown
- The How and Why of Open DataToggle Dropdown
- Love Data Week 2024
- Love Data Week 2025
Contact the Data Working Group
Have another idea? Have a question we haven't answered yet?
Contact us!
Plan for future use
Envisioning future use helps broaden the reach and impact of your data, and takes planning at the beginning of your project.
This will help you identify potential areas of friction when trying to share your data (e.g., proprietary formats, unexpected costs related to sharing), and help you think critically of the future of your data.
In the tabs below, learn more about:
- Broadening your audience, by determining the who, what, when, where, and how of sharing your data.
- Considering future applications of your data, to help you critically assess what data is of future use.
- Licensing your data and research products, to ensure that others understand how you want your data used, and under what circumstances.
Plan for future use
Share your data as broadly as possible.
“Share your data” -- these three little words belie a host of questions and processes. When you break down your own sharing into who, what, when, where, why, and how, you can answer many of the questions your intended repository or future collaborators will ask of you.
Questions to think about:
- Who:
- Who generated your data? Include graduate students, post docs, undergraduate students, technical staff, etc.
- Who will be able to use your data? The public, only other scholars, only certain PIs, only certain individuals with particular funding streams?
- If you need to apply restrictions to access, explain why.
- What:
- What are you sharing? Describe the data you are sharing.
- What formats will your data take? Describe the file formats. Try to use open source, stable file formats.
- What supplemental material is needed to help others understand your data?
- Include a readme file, data dictionary, and a description of your file naming convention and file hierarchy.
- You may also need to include code or other supplemental processes.
- What restrictions do you need to consider ahead of sharing your data?
- If you are unable to openly share your data, state why.
- Consider sharing de-identified data, or group-level data instead of individual- or participant-level data.
- When:
- When was your data generated?
- This could be the time period in which data was collected, or the time period you used as part of your parameters.
- When will you make your data available to others for re-use? This is often at time of first article publication, but may depend on your current status (e.g., writing a dissertation) or your discipline (release data immediately after it is cleaned).
- When was your data generated?
- Where:
- Where will you share your data? A corporate repository? A discipline-specific repository? A repository at your institution? Your website? Something else?
- Why:
- Why was this data generated - refer to your larger projects, grants, or papers and other research products that link to your dataset and help provide the bigger picture of your work.
- How:
- How was this data produced? Document information on:
- Instruments used
- Software used
- Methods used - including for data capture, pre-processing, post-processing, cleaning data, or whatever is relevant in your circumstance.
- How is this data interpreted?
- Include software necessary to interpret your data.
- How will you help others understand this data?
- Including a readme file, a data dictionary, and a description of your file naming convention and file hierarchy can help others easily or quickly understand the data you have generated. (Not to mention, it can help you interpret your data at a future point in time!).
- How was this data produced? Document information on:
- Miscellaneous:
- Include any other concepts, restrictions, or considerations you have made ahead of sharing your data. This might include referencing other policies that impact how your data is shared, other policies that affect data management or data security, or expectations in your field.
Activity:
Map out the relationship of the research lifecycle to the data lifecycle. How does your data map to your research?
Stuck? Take a peek at this diagram, or dive into Twitter to look examples of the #datalifecycle and the #researchlifecycle.
Further reading:
- Ten Simple Rules for the Care and Feeding of Scientific DataGoodman A, Pepe A, Blocker AW, Borgman CL, Cranmer K, Crosas M, et al. (2014) Ten Simple Rules for the Care and Feeding of Scientific Data. PLoS Comput Biol 10(4): e1003542. https://doi.org/10.1371/journal.pcbi.1003542
Assess future uses of your data.
How might your data be used in the future? Funders are sensitive to funding studies that duplicate effort -- and articulating that your data is both novel and reusable will strengthen your proposal. Alternatively, noting that data exists elsewhere and you will be building off of existing data can demonstrate how deeply you understand the field, or how connected you are to current research.
Describing potential future uses of your work can be a useful exercise to do ahead of writing a grant. You can start to see long-term and future implications of your work.
Furthermore, you can start to determine what data you must share and curate for long-term protection and access, and what data is supplemental or ephemeral.
Questions to think about:
- Who might reuse the data?
- Is your data extremely specialized, so only a handful of individuals will be able to understand it and use it?
- How might you improve reuse outside of your field?
- How might the data be reused?
- Can your data be combined with other data?
- How have you made your data interoperable?
- Are there opportunities for meta-analysis?
- What data has long-term use and impact?
- Determine what data could be of future use ant interest. See the activity or the selected works for more detail.
- Data underlying your publications are important to protect for the long term, especially as retractions can occur due to inability to locate source data, even for articles published 20 years ago.
Activity:
Identify the future uses of your data. Use our checklist to help you think about future uses of your data.
- Identify the future uses of your dataWorksheet that goes over different categories and ideas related to future uses of your data, including verification, teaching and learning, future analysis, and others.
Further reading:
- How to Appraise and Select Research Data for CurationWhyte, A. & Wilson, A. (2010). "How to Appraise and Select Research Data for Curation." DCC How-to Guides. Edinburgh: Digital Curation Centre. Available online: http://www.dcc.ac.uk/resources/how-guides
- Five steps to decide what data to keepDCC (2014). 'Five steps to decide what data to keep: a checklist for appraising research data v.1'. Edinburgh: Digital Curation Centre. http://www.dcc.ac.uk/resources/how-guides
Determine what license best fits your needs.
Licensing data and other products of your research helps others understand what permissions you give in re-distributing and re-using your data. Licenses reduce uncertainty and ambiguity, and tell users up front if their intended use is ok.
Note that facts are not copyrightable, so copyright laws do not apply to facts.
Resources:
- Considerations for licensors and licenseesFrom Creative Commons, a list of basic things to think about before applying a Creative Commons license to your material. Highly recommended.
- How to License Research DataGuide to licensing research data from the Digital Curation Centre.
- Choosing an open-source licenceOpen source licensing information from the Software Sustainability Institute
- Open Source software license list from the Open Source InitiativeOpen source licenses are licenses that comply with the Open Source Definition – in brief, they allow software to be freely used, modified, and shared. To be approved by the Open Source Initiative (also known as the OSI) a license must go through the Open Source Initiative’s license review process.
Questions to think about:
- What do you want to license - code, scripts, computer programs, drawings, images, audio files, video files, spreadsheets, etc.?
- Do you want to receive attribution for your data?
- Do you want others to be able to reuse your data?
- Do you want others to be able to build upon your data?
Further reading:
- Sharing data: Legal and policy considerations.Margaret O'Brien, Rebecca Lubas, Ruth Duerr, Todd Grappone, Trisha Cruse, DataONE (September 01, 2011) "Best Practice: Sharing data: legal and policy considerations". Accessed through the Data Management Skillbuilding Hub at https://dataoneorg.github.io/Education/bestpractices/sharing-data-legal on Mar 06, 2024.
- Last Updated: Oct 21, 2024 1:56 PM
- URL: https://guides.library.umass.edu/data
- Print Page