As we debate the topic, it might be useful to have one or more examples of idea-to-publication workflows. I don't mean just documented code on GitHub or Bitbucket, but an annotated process that includes tools and methods for each step:
- Initial idea development, reference management, document storage/collection
This might mean notes on using editing tools (emacs, vim, OpenOffice), formats (Markdown, LaTeX, Org Mode), and management tools (BibDesk, Mendeley, etc.)
- Data discovery/collection
This might include lists, and lists-of-lists of open data repositories (ICPSR for social scientists, Data.gov for the US, World Bank Databank, etc.) as well as tools/methods for collection (open data formats, web scraping, etc.).
- Code development and storage
I would include under this "code" for statistical analysis even if it is not done in what some would call a programming language (no need to debate this here), such as steps taken for spreadsheets, but also more technical work in R, Python, etc.
There are some good examples of how to use Git (this and this), but nothing that works this into the entire publishing workflow.
- Paper development/construction and revision tracking
I don't have a good sense for how prevalent the use is for tools that combine disparate tools into one document (knitr, Sweave, etc.), but I feel like these would be important things for someone to see as part of combining everything in a single, easily-shared and open framework.
There are plenty of great tutorials and whatnot for each step. But I'm asking if there are any good cases where a researcher has provided a guide through their whole process for a single work.
This post has been migrated from the Open Science private beta at StackExchange (A51.SE)