This is a list of data sources I've identified as actually being used in projects, starting from here. The goal is to understand the different types of data source to know how we can access them and what input/output formats should be targeted:
- MySQL database: mysql_31129_celegans at my01.winhost.com
- Ion Channel Spreadsheet includes citations from Pubmed showing relationships of ion channels to genes.
- Wormbase, generally. There seems to be some query functionality, but it uses its own 'special' query language -- seems not very complicated; pattern based.
- Time series data?
- Movement validation data pointing to the ftp from Laura Grundy
- Comments on some of the spreadsheet cells -- not exported from google drive.
I've also found a lot of scripts for extracting data from different formats:
- Extracting from the Laura Grundy ftp
- From spreadsheets(CElegansNeuroML project)
I'm still looking around. I'll make a new post with any updates.
No comments:
Post a Comment