quick memo for genome assembly workshop
First talk of the Genome Assembly Workshop (#gaw) meeting will be Richard Durbin from the Wellcome Trust Sanger Institute
2011-03-16 00:06:24About a 100 people here for the #gaw meeting and also for the @genome10K meeting that follows
2011-03-16 00:07:52I'm sort of live tweeting in my role of @assemblathon co-organizer at the Genome Assembly Workshop (#gaw) in Santa Cruz.
2011-03-16 00:10:34Start of the #gaw (@Assemblathon) part of the @Genome10K meeting, Richard Durbin starting proceedings talking about SeQuence Graph format
2011-03-16 00:17:15Including us... RT @assemblathon About a 100 people here for the #gaw meeting and also for the @genome10K meeting that follows
2011-03-16 00:18:08Richard Durbin is talking about an idea for standardizing an assembly graph format #gaw
2011-03-16 00:18:08RD giving some assembly issue background: 10-15% of genome cannot be reliably mapped to. Most false positives come from mapping errors. #gaw
2011-03-16 00:25:44Richard Durbin talking about formats (BAM, etc.), and standardizing an assembly graph format in his talk on SeQuence Graph (SQG) #gaw
2011-03-16 00:31:52RD: lots of the desired tools outlined developed to some extent by Amos http://sourceforge.net/apps/mediawiki/amos/index.php?title=AMOS #gaw
2011-03-16 00:48:18Good question about how more advanced genome file formats may not always be as intuitive for a typical user of data #gaw
2011-03-16 00:49:51Now its time to assess the state of the art in genome assembly. Ian Korf up now to give the results of the @Assemblathon #gaw
2011-03-16 00:54:18Ian Korf: @Assemblathon stats first: 7 countries. 17 teams. 62 assemblies. >100 metrics. So much data, had to be sent on hard disks. #gaw
2011-03-16 01:04:44Contamination part of @Assemblathon challenge “all libraries will contain some bacterial contamination” – 5% of seq was from E. Coli #gaw
2011-03-16 01:10:23You can see the results from the Assemblathan here: http://tinyurl.com/4cpr9xv #gaw #DataOverload
2011-03-16 01:12:10A PDF of my talk (with notes) is now up on the Assemblathon website http://t.co/LxLWwOP #gaw
2011-03-16 01:21:34Assemblers have invented sequences not exist in either haplotypes used in the contest. #assemblathon #gaw
2011-03-16 01:25:31“@iGenomics: Assemblers have invented sequences not exist in either haplotypes used in the contest. #assemblathon #gaw”
2011-03-16 01:26:36From @kbradnam RT @assemblathon A PDF of my talk (with notes) is now up on the Assemblathon website assemblathon.org/pages/results #gaw
2011-03-16 01:26:38Hap path gives no penalty to switch haplotypes but does count for inconsistent haplotype blocks in the assemblies #assemblathon #gaw
2011-03-16 01:31:56Nearly all assemblies had excellent coverage (reads are all mostly mappable) # assemblathon #gaw
2011-03-16 01:39:21Ian has been using his iPhone as his text prompter. #assemblathon #gaw
2011-03-16 01:44:59Next: David Jaffe and Sante Gnerre ALLPATHS-LG "good cheep assemblies" #gaw #assemblathon
2011-03-16 02:22:44David Jaffe is talking about ALLPATH-LG. 29genomes sequenced by capillary technology cost about ~$20M per genome. #gaw
2011-03-16 02:23:22Up now is David Jaffe/Sante Gnerre from @broadinstitute with "ALLPATHS-LG: good cheap genome assemblies" #assemblathon #gaw
2011-03-16 02:24:32SG: Needed new computational methods and new laboratory methods. #gaw #assemblathon // assembly is not just an algorithmic problem!!
2011-03-16 02:25:06