#general

2016-11-21

Marcel Ramos Pérez (15:19:11): > @Marcel Ramos Pérez has joined the channel

Tim Triche (15:53:48): > @Tim Triche has joined the channel

Marcel Ramos Pérez (16:06:41): > Hi@Tim Triche, we just started this slack group and we expect for others to join too

Benjamin Haibe-Kains (16:19:29): > @Benjamin Haibe-Kains has joined the channel

Benjamin Haibe-Kains (16:22:54): > Hi@Marcel Ramos Pérez, so tell me, which are the issues you faced with PharmacoGx?

Aedin Culhane (16:24:54): > @Aedin Culhane has joined the channel

Marcel Ramos Pérez (16:26:45): > Hi@Benjamin Haibe-Kains, in general, I had some issues with navigating the complexity of the pSet object. It isn’t so straightforward to write a conversion function considering that sometimes these objects may have sensitivity and/or peturbation data. I have some code that I worked on here but I would need to look into it morehttps://github.com/LiNk-NY/PharmacoGx/blob/master/R/PharmacoSetClass.R#L1320 - Attachment (GitHub): LiNk-NY/PharmacoGx > Contribute to PharmacoGx development by creating an account on GitHub.

Marcel Ramos Pérez (16:27:29): > Another issue that I had was relating all the identifiers in the pSet object. They weren’t consistent throughout.

Vince Carey (16:30:14): > @Vince Carey has joined the channel

Sean Davis (17:05:51): > @Sean Davis has joined the channel

Sean Davis (17:13:06): > Thanks,@Marcel Ramos Pérez, for the invite. We use slack extensively and I really like it.

Sean Davis (17:13:17): > Looking forward to fluid conversations here.

Marcel Ramos Pérez (17:16:26): > Hi Sean! Thanks all for joining. We hope to enable discussions about the projects at Bioconductor and any issues/challenges that the Bioconductor community is facing. Feel free to create channels for focused discussion.

Benjamin Haibe-Kains (18:55:20): > @Marcel Ramos PérezOK, I will look at the code

Michael Lawrence (19:04:46): > @Michael Lawrence has joined the channel

Levi Waldron (19:13:06): > @Levi Waldron has joined the channel

Jack Zhu (20:14:56): > @Jack Zhu has joined the channel

Peter Hickey (22:01:35): > @Peter Hickey has joined the channel

Marcel Ramos Pérez (23:24:33): > @Marcel Ramos Pérez pinned a message to this channel.

2016-11-22

Lucas Schiffer (13:20:18): > @Lucas Schiffer has joined the channel

Lucas Schiffer (13:39:34): > Hi all, nice to see a few familiar names here.

Kasper D. Hansen (17:27:48): > @Kasper D. Hansen has joined the channel

2016-11-23

Phil Chapman (05:12:43): > @Phil Chapman has joined the channel

Sean Davis (08:16:38): > I set up an#introductionsroom for folks to jot quick one-liners (or more) as introductions. It can help keep straight who is here and what their roles are. Use as you see fit.

Marcel Ramos Pérez (10:48:33): > Thanks Sean!

2016-11-28

Martin Morgan (13:12:10): > @Martin Morgan has joined the channel

Nitesh Turaga (13:44:51): > @Nitesh Turaga has joined the channel

2016-12-13

Kasper D. Hansen (22:04:21): > I have created a package_submissions channel. Someone who has access to thehttps://github.com/Bioconductor/Contributionsrepository (access = admin rights) can setup a github integration. This means specifically that new packages (ie. new issues) in the repos will be posted as messages to the channel

Kasper D. Hansen (22:04:27): > That would be nice to me.

Kasper D. Hansen (22:04:55): > At least I think you need admin rights to the Github repos; not 100% sure

2016-12-14

Hervé Pagès (00:59:17): > @Hervé Pagès has joined the channel

Marcel Ramos Pérez (11:57:30): > Hi@Kasper D. Hansen, I can set up an integration for this.

Marcel Ramos Pérez (12:03:02): > For those interested, join the#package_submissionschannel

2016-12-19

Levi Waldron (15:26:26): > Anyone notice that the links in#new_packagesall just go to thebioconductor.orgbase URL?

Marcel Ramos Pérez (15:28:22): > @Levi WaldronStrange. The first link in the message goes tobioconductor.organd the other 2 go to the package link.

Levi Waldron (15:30:22): > Ah you’re right! Not important anyways, especially being able to click on any of the later links.

2016-12-22

Marcel Ramos Pérez (11:25:02): > set up a reminder “BiocMultiAssay meeting” in this channel at 11:55am today, Eastern Standard Time.

Marcel Ramos Pérez (11:25:54): > set up a reminder “BiocMultiAssay meeting @noon” in this channel at 11:50am today, Eastern Standard Time.

Marcel Ramos Pérez (11:34:45): > set up a reminder “BiocMultiAssay meeting” in this channel at 11:44am today, Eastern Standard Time.

Marcel Ramos Pérez (11:36:15): > set up a reminder “BiocMultiAssay Meeting_in 10 min” in this channel at 11:50am today, Eastern Standard Time.

USLACKBOT (11:50:00): > Reminder: BiocMultiAssay Meeting_in 10 min.

Lucas Schiffer (11:55:41): > http://huntercollege.adobeconnect.com/biocmultiassay

2017-01-07

Ted Habermann (11:39:53): > @Ted Habermann has joined the channel

2017-01-13

Sean Davis (11:51:23): > Just FYI,@Vince Careypointed out this project to me. I had heard@Ted Habermanntalk about it. Looks great - Attachment (support.hdfgroup.org): HDF Server > The HDF Group is a not-for-profit corporation with the mission of sustaining the HDF technologies and supporting HDF user communities worldwide with production-quality software and services.

Sean Davis (11:52:20): > @Ted Habermann, just out of curiosity, how are you thinking about security and data sharing/access controls on the server?

2017-01-18

Azfar Basunia (10:56:28): > @Azfar Basunia has joined the channel

kristen_humphrey (12:16:39): > @kristen_humphrey has joined the channel

2017-01-31

Sean Davis (11:06:34): > <!channel>: Just a note that Rahul Satija will be speaking at 12 noon EST today:https://videocast.nih.gov/summary.asp?live=21733&bhcp=1

Tim Triche (12:47:24): > sean this webcast is phenomenal. thanks for the heads up.

Sean Davis (12:48:59): > Glad you are enjoying it,@Tim Triche. We are starting a single cell analysis lecture series here at NIH; Rahul is the first. Any suggestions of good speakers with useful software and interesting science are great candidates. Let me know.

Tim Triche (12:49:18): > Will Greenleaf

Tim Triche (12:49:23): > Cole Trapnell

Tim Triche (12:49:32): > Aviv Regev

Tim Triche (12:52:48): > Ido Amit (or anyone from his lab)

Tim Triche (12:57:45): > Satija just presented a fundamental problem about ex vivo vs. in vivo expression analysis – what happens when both the cells present and the transcripts present are changing. Ex vivo, it’s tough to have a control. In vivo (e.g. with CyTOF or IMC) it may be more possible to control one or the other by literally fixing the system at multiple time points. Garry Nolan and Dana Pe’er have worked extensively on tools to usefully process the data from those types of experiments and made them publicly available (e.g. via Cytobank, spade in Bioconductor, etc). A lot of people don’t know what flow/mass cytometry is. Most everyone should.

Tim Triche (12:59:42): > IMC (imaging mass cytometry, “flow on fixed tissues”) is closest to “in vivo” for tissues (stain with lanthanide-conjugated Abs, target individual cells, vaporize into ion stream, get data) but I am not aware of robust or widely available software to handle its results yet.

Tim Triche (13:00:37): > So, Garry Nolan and Dana Pe’er. Greg Finak and Raphael Gottardo as well, of course!

Tim Triche (13:01:38): > Stephanie Hicks (re: “how do you know when an experiment is over”):grinning:

Tim Triche (13:02:38): > Satija is going to be a tough act to follow, though. Man he’s doing a terrific job.

Tim Triche (13:02:57): > I hope he will make his slides available.

2017-02-01

Sean Davis (08:50:16): > Video has been archived and will be available in a couple of days.https://videocast.nih.gov/summary.asp?live=21733&bhcp=1 - Attachment (videocast.nih.gov): NIH VideoCast - Learning the ‘metadata’ of the cell with single cell genomics and Seurat > Learning the ‘metadata’ of the cell with single cell genomics and Seurat

2017-02-09

Lori Shepherd (14:41:59): > @Lori Shepherd has joined the channel

2017-03-17

Sean Davis (13:44:31): > Hi,@Marcel Ramos Pérez,@Levi Waldron. I have been missing out a little on development around rangedraggedXYZ. If I have a per-sample set of data-frame-like objects like:

Sean Davis (13:44:34): > Sample Chromosome Start End Num_Probes Segment_Mean > 1 sample_1 1 3301765 247650984 129902 0.0019 > 2 sample_1 2 480597 48844331 29296 0.0070 > 3 sample_1 2 48845813 48846435 4 -1.1119 > 4 sample_1 2 48848788 151775389 51079 0.0056 > 5 sample_1 2 151776167 151778171 3 -1.4951 > 6 sample_1 2 151781507 241537572 51895 0.0079

Sean Davis (13:45:15): > What is the current state-of-the-art for containing them?

Marcel Ramos Pérez (13:54:39): > The state of the art class to use isRaggedExperiment. It isn’t in Bioconductor yet but you can find it athttps://github.com/Bioconductor/RaggedExperiment. The process is pretty straight forward although a bit lengthy. You would have to read in thedata.frames and then useGenomicRanges::makeGRangesFromDataFrameand then you can use theRaggedExperimentconstructor on the group of GRanges or on theGRangesList. - Attachment (GitHub): Bioconductor/RaggedExperiment > Contribute to RaggedExperiment development by creating an account on GitHub.

Sean Davis (13:55:51): > Thanks,@Marcel Ramos Pérez. Are you planning on having this in for release?

Marcel Ramos Pérez (13:56:32): > I believe so. I can double check with@Martin Morgan

Martin Morgan (13:57:17): > yep

2017-04-21

Peter Hickey (09:25:15): > A group of us have submitted a ‘birds of a feather’ proposal for bioc2017 on single-cell BioC infrastructure. We’ve been discussing it via email (and github) but want to move to slack cause the email chain is running rather long already. Can I (or an admin?) invite them to join the bioc slack team and we can set up a channel to continue our discussion?

Marcel Ramos Pérez (11:16:34): > Yes, of course. I can give you invite privileges

Peter Hickey (11:26:36): > thanks marcel!

Marcel Ramos Pérez (11:30:08): > No problem!

Davide Risso (11:57:31): > @Davide Risso has joined the channel

Stephanie Hicks (12:57:33): > @Stephanie Hicks has joined the channel

Davis McCarthy (13:03:31): > @Davis McCarthy has joined the channel

Aaron Lun (13:26:42): > @Aaron Lun has joined the channel

Andrew McDavid (15:05:34): > @Andrew McDavid has joined the channel

2017-04-28

Greg Finak (21:48:08): > @Greg Finak has joined the channel

hcorrada (22:18:52): > @hcorrada has joined the channel

2017-04-30

Raphael Gottardo (02:19:09): > @Raphael Gottardo has joined the channel

2017-05-03

Artem Sokolov (10:31:16): > @Artem Sokolov has joined the channel

2017-05-10

Valerie Obenchain (15:12:13): > @Valerie Obenchain has joined the channel

Vladimir Kiselev (16:34:50): > @Vladimir Kiselev has joined the channel

Mike Smith (16:52:56): > @Mike Smith has joined the channel

2017-05-11

Mike Jiang (14:48:35): > @Mike Jiang has joined the channel

2017-05-12

Wolfgang Huber (05:25:11): > @Wolfgang Huber has joined the channel

Shweta Gopal (11:57:25): > @Shweta Gopal has joined the channel

Daniel Van Twisk (13:22:14): > @Daniel Van Twisk has joined the channel

Marcel Ramos Pérez (16:45:07): > Hi all, there is no way to make the slack team “public.” New members would be have to added by email invitation (unless we agree to include specific domains of membership, e.g.,roswellpark.org). In a way, this email invite system provides some protection from allowing would-be spammers to the slack team. > P.S. I can also change a setting and automatically add new members to the#bigdata-repchannel.

2017-05-13

Sean Davis (09:56:19): > @Marcel Ramos Pérez, I usehttps://github.com/rauchg/slackinto effectively make a slack team public. > > 1. Create a free account athttps://heroku.com2. Navigate tohttps://github.com/rauchg/slackin3. Click on heroku link > 4. Follow instructions to fire up an the slackin app. > 5. Share the resulting app link with folks; with just am email, an account is created on slack, regardless of domain, etc.

Marcel Ramos Pérez (19:38:24): > Thanks@Sean Davis. I’ll try that out.

2017-05-15

Sean Davis (14:12:34): > Introduced a new channel,#microbiome_metagenome. Some folks here might be interested in sharing and discussing there.

Durga Addepalli (15:10:30): > @Durga Addepalli has joined the channel

2017-05-16

Sean Davis (10:07:21): > Might be of interest to pharmacogenomics folks with a cancer bent:https://discover.nci.nih.gov/cellminercdb/

Sean Davis (10:07:48): > Yes, it is a shiny app.

Artem Sokolov (10:40:59): > This is great@Sean Davis! I just had somebody ask me about tools to browse CCLE gene expression data earlier this week.

Sean Davis (10:44:16): > Not mine,@Artem Sokolov, but I am just down the hall from devs.

Steve Tsang (14:16:42): > @Steve Tsang has joined the channel

Marcel Ramos Pérez (15:40:32): > <!here|@here>Our sign up page is now livehttps://bioc-community.herokuapp.com/

Sean Davis (18:03:28): > Thanks,@Marcel Ramos Pérez.

Marcel Ramos Pérez (18:04:04): > Thank you@Sean Davis:+1:

2017-05-18

John Readey (18:26:45): > @John Readey has joined the channel

John Readey (18:38:34): > Hey guys - I’m the HDF Server author - let me know if you have any questions

Marcel Ramos Pérez (21:28:56): > Welcome John! Perhaps@Vince Careyor@Shweta Gopalhave questions:slightly_smiling_face:

2017-05-19

Ludwig Geistlinger (11:49:00): > @Ludwig Geistlinger has joined the channel

Aedin Culhane (14:09:42): > John, I think@Martin Morganmight have questions. He put together a package for reading the 10x genomics dataset.

Aedin Culhane (14:16:24): > John. Join the bigdata-rep channel for discussion on HDF5

Tiago C. Silva (20:15:49): > @Tiago C. Silva has joined the channel

2017-05-20

John Readey (14:04:18): > Thx@Aedin Culhane- I’ve joined the channel

2017-05-30

Levi Waldron (14:40:31): > A MultiAssayExperiment video for the Bioconductor YouTube channel:https://youtu.be/w6HWAHaDpyk - Attachment (YouTube): MultiAssayExperiment demo

Martin Morgan (15:01:31): > :+1:

2017-05-31

Sean Davis (06:23:43): > @Levi Waldron, added to Bioconductor playlist. Thanks

Sean Davis (08:10:30): > For the single cell folks:https://www.eventbrite.com/e/annual-single-cell-analysis-investigators-meeting-2017-registration-29448592533?internal_ref=login - Attachment (Eventbrite): Annual Single Cell Analysis Investigators Meeting 2017 > ABOUT THE MEETING The Single Cell Analysis Program (SCAP), supported by the National Institutes of Health (NIH) Common Fund, will host its 5th and final Annual Investigators Meeting on June 29-30, 2017, at the Clinical Center on the NIH campus in Bethesda, Maryland. The purpose of the SCAP is to accelerate the discovery, development, and translation of cross-cutting, innovative approaches to analyzing the heterogeneity of biologically relevant populations of cells in situ. MEETING OBJECTIVES Convene the funded SCAP investigative teams to update the community on their research and consider current conceptual, technical, and methodological challenges in single cell analysis. Determine major biomedical research opportunities that can be addressed by the Common Fund rather than individual NIH Institutes or Centers. Discuss how relevant groundbreaking technologies and approaches in SCA can be disseminated to the research community effectively in the near future. AGENDA To be announced DIRECTIONS & NIH VISITOR INFO NIH Clinical Center (Building 10)9000 Rockville Pike, Bethesda, Maryland (see map) - Masur AuditoriumMedical Center Metro Station (Red Line) AIRPORTS/METRO Reagan National Airport (DCA) is the closest airport and is connected to the Washington Metro. Dulles International Airport (IAD) and Baltimore Washington International Airport (BWI) are approximately 45 minutes from the NIH campus but are not accessible through the Washington Metro. The NIH campus (located at the Medical Center Station) is accessible from the recommended Bethesda Hyatt (located at the Bethesda Station) via the Red Line of the Washington Metro. NIH VISITOR INFORMATION The NIH campus requires a valid, current, photo ID for entry. Visitor passes must be worn at all times. If you leave campus and return at a later time, you will be required to go through security again upon re-entry. If you choose to travel to the NIH by car, please note that pay parking is available but space is extremely limited. All vehicles and passengers must be screened at the Gateway Visitor’s entrance. Please allow adequate time for security screening. Visit the NIH visitor’s web page for more information. FOOD & BEVERAGES Food and beverages must be purchased. A full cafeteria is open from 6:30 a.m. - 2:30 p.m. located on the B1 level of the Clinical Center. Three concession/coffee stands are also available. The concession stand is located on the B1 level near the cafeteria and is open from 7:00 a.m. - 6:00 p.m. Two coffee stands are open from 7:00 a.m. - 4:00 p.m. and are located on the 1st floor in the CRC and the FAES corridor. LODGING INFORMATION With the exception of invited keynote speakers, all other participants and presenters are responsible for all lodging charges, taxes, and incidentals. FAQs Can I update my registration information? Yes, Eventbrite allows you to update your information at any time. Who should I contact with questions? For questions about meeting content, logistics, or abstract submissions, please contact us at single_cell@mail.nih.gov. FAQs FOR PRESENTERS Should all SCAP projects present? Yes. Each group with an active SCAP award must prepare a poster or apply for a talk (limited space available) describing the funded single cell analysis project and current progress. Designate one individual as the primary presenter of the poster for each SCAP award. Do I need to submit an abstract? If you wish to present a poster or talk at the meeting, you must submit an abstract. Investigators actively funded under SCAP RFAs are required to submit an abstract describing their work and progress. Due to a limited number of slots for both presentations and posters, we ask for one abstract per award. Please coordinate among your award collaborators to determine the abstract you wish to submit. Funded investigators who have not spoken at previous meetings will be given priority during talk selection, but everyone is welcome to submit an abstract for either format. How do I submit an abstract? If you are interested in presenting a talk or a poster, please indicate your interest during registration, format your abstract following the Abstract Template guidelines, and email your abstract to single_cell@mail.nih.gov. Abstracts are due by June 2, 2017. Please include relevant NIH grant numbers in your acknowledgements section. When will my abstract be approved? You will receive a confirmation email when your abstract is submitted and a notification email when your abstract has been selected for a presentation or poster. All SCAP grantees are guaranteed poster space. What are the presentation or poster requirements? Posters should be no more than 4 ft x 4 ft in size. Thumbtacks/velcro will be provided on-site. Each presenter will receive an assigned number and designated space for the poster session. Materials from previous meetings are posted on the Single Cell Analysis Program Website. Have questions about Annual Single Cell Analysis Investigators Meeting 2017? Contact Single Cell Analysis Program

Stephanie Hicks (10:17:05) (in thread): > @Sean DavisThanks for the link! Sadly I’m committed to something else those days, but do you know if it will be live streamed?

Sean Davis (10:54:35) (in thread): > I don’t know, but an email to:single_cell@mail.nih.govmight get you an answer as to videocast. If you get stonewalled, let me know and I’ll check into it.

Sean Davis (10:54:35): - Attachment: Attachment > For the single cell folks: https://www.eventbrite.com/e/annual-single-cell-analysis-investigators-meeting-2017-registration-29448592533?internal_ref=login - Attachment: Attachment > I don’t know, but an email to: mailto:single_cell@mail.nih.gov|single_cell@mail.nih.gov might get you an answer as to videocast. If you get stonewalled, let me know and I’ll check into it.

Stephanie Hicks (10:54:56) (in thread): > thanks!

2017-06-01

Sean Davis (07:22:19): > For those with iRODS installations locally or who have to interact with iRODS, a new iRODS R client is available:https://github.com/irods/irods_client_library_r_cpp - Attachment (GitHub): irods/irods_client_library_r_cpp > irods_client_library_r_cpp - rirods R-Package

Aedin Culhane (12:53:29): > @Michael Lawrencethe HDF5 conversation is in the channel#bigdata-rep.

Rafael Irizarry (14:34:10): > @Rafael Irizarry has joined the channel

Keegan Korthauer (14:38:21): > @Keegan Korthauer has joined the channel

Will Townes (17:08:26): > @Will Townes has joined the channel

Caleb Lareau (21:00:42): > @Caleb Lareau has joined the channel

2017-06-02

Leonard Goldstein (14:57:40): > @Leonard Goldstein has joined the channel

2017-06-05

Stephanie Hicks (13:53:52) (in thread): > Heard back on this, but unfortunately it won’t be live streamed this year.

2017-06-08

Cole Trapnell (12:37:17): > @Cole Trapnell has joined the channel

Samuela Pollack (15:55:13): > @Samuela Pollack has joined the channel

2017-06-09

Fanny Perraudeau (21:03:32): > @Fanny Perraudeau has joined the channel

2017-06-21

Michael Stadler (02:04:51): > @Michael Stadler has joined the channel

Panagiotis Papasaikas (03:13:48): > @Panagiotis Papasaikas has joined the channel

2017-06-23

Martin Aryee (13:28:19): > @Martin Aryee has joined the channel

2017-06-29

Steve Tsang (09:46:06): > @Steve Tsang has joined the channel

2017-07-26

Ju Yeong Kim (14:24:18): > @Ju Yeong Kim has joined the channel

2017-07-28

Leonardo Collado Torres (08:32:06): > @Leonardo Collado Torres has joined the channel

Ayshwarya Subramanian (13:21:14): > @Ayshwarya Subramanian has joined the channel

2017-07-30

Kevin Rue-Albrecht (15:58:05): > @Kevin Rue-Albrecht has joined the channel

2017-07-31

Radhika Khetani (10:04:19): > @Radhika Khetani has joined the channel

John Hutchinson (10:34:49): > @John Hutchinson has joined the channel

Lorena Pantano (12:19:52): > @Lorena Pantano has joined the channel

2017-08-02

Michael Steinbaugh (12:10:54): > @Michael Steinbaugh has joined the channel

2017-08-07

Sean Davis (13:48:00): > To invite others:https://bioc-community.herokuapp.com/

Sean Davis (13:48:09): > @Sean Davis pinned a message to this channel.

Peter Hickey (16:24:16): > for those who couldn’t make BioC2017, i wrote up my highlights from developer dayhttps://twitter.com/PeteHaitch/status/894628520042921984includes an overview of recent work on single-cell genomics in BioC, includingSingleCellExperimentandDelayedArrayand friends - Attachment (twitter): Attachment > A belated blog of my highlights from Developer Day at #bioc2017 http://peterhickey.org/blog/2017/08/07/bioc2017-developer-day.html #rstats

2017-08-10

Stuart Lee (20:09:38): > @Stuart Lee has joined the channel

Peter Hickey (21:14:20): > @Sean Davis,@Marcel Ramos Pérez:@Stuart Leewas unable to sign up viahttps://bioc-community.herokuapp.com/- have there been any other reported issues?

Peter Hickey (21:15:51): > specifically got an error error on the app(missing_scope)

Di Cook (21:22:13): > @Di Cook has joined the channel

2017-08-11

Sean Davis (08:53:19): > Thanks,@Peter Hickey. I see the same thing, so we may need to update the app,@Marcel Ramos Pérez.

Marcel Ramos Pérez (08:54:21): > I’ll get to that today

Marcel Ramos Pérez (09:52:06): > <!here|@here>The app should be functional now. Please let me know if there are any issues. Thanks.

2017-08-12

Sean Davis (15:17:07): > Hi, all. I created the#bioc_gitchannel to discuss all things bioc and git. If we don’t use it, it can go away, but I suspect there will be some interest.

Marcel Ramos Pérez (15:20:04): > Thanks@Sean Davis

2017-08-14

Meeta Mistry (14:37:07): > @Meeta Mistry has joined the channel

2017-08-17

Martin Morgan (16:35:35): > how am I supposed to tell people to join this group? Following Sean’s pointer tohttps://bioc-community.herokuapp.com/takes me to an un-inviting page advertising that there is ‘nothing here yet’

Nitesh Turaga (16:37:40): > https://community-bioc.slack.com/messages/C6MVC96AZ

Nitesh Turaga (16:38:36): > oops that’s for the#bioc_git

Nitesh Turaga (16:38:48): > https://community-bioc.slack.com/messages/C35G93GJH

Nitesh Turaga (16:38:52): > that is for#general

Marcel Ramos Pérez (16:39:28): > What group? Do you mean the Slack team (Bioc-community)?

Martin Morgan (16:44:57): > Yes, slack team bioc-community. If I follow Nitesh’s link and I’m not siged in, I’m asked for an email address and password to sign in, but confusingly told that if I have a particular email domain (the person I would like to invite does not have that email domain) I could create an account. So how would someone not atroswellpark.orgorfredhutch.orgjoin the team?

Marcel Ramos Pérez (16:45:46): > We have a link for public use

Marcel Ramos Pérez (16:45:53): > https://bioc-community.herokuapp.com/

Marcel Ramos Pérez (16:46:05): > Ah I see the issue

Marcel Ramos Pérez (16:46:11): > Something is wrong with the app

Martin Morgan (16:46:53): > …or not, it ‘works for me’ this minute, but obviously not a nice way to invite people…

Marcel Ramos Pérez (16:49:01): > Ah that’s strange after hitting refresh a few times it shows up.. We set it up so that we didn’t have to invite each user with their email address individually

Marcel Ramos Pérez (16:55:58): > We could look into using a different platform for the APP ’cause it seems that the issue is with Heroku’s spin up time

Marcel Ramos Pérez (16:56:16): > https://github.com/rauchg/slackin

2017-09-01

Sean Davis (10:49:37): > I created a discussion channel for#pharmacogenomicsin hopes of building an online community around drug response prediction, target prediction, repositioning, and related chemi-informatics.

Kasper D. Hansen (11:06:13): > might be good to try to get Thomas Girke involved

Thomas Girke (12:35:31): > @Thomas Girke has joined the channel

2017-09-05

Benjamin Haibe-Kains (09:10:25): > nice, thanks@Sean Davis

2017-09-06

Kevin Horan (19:51:47): > @Kevin Horan has joined the channel

2017-09-12

Matt Ritchie (02:57:40): > @Matt Ritchie has joined the channel

2017-09-15

Alexander Bertram (10:19:58): > @Alexander Bertram has joined the channel

2017-10-06

Joshua Campbell (15:37:46): > @Joshua Campbell has joined the channel

David Jenkins (16:32:12): > @David Jenkins has joined the channel

2017-10-07

Evan Johnson (21:54:13): > @Evan Johnson has joined the channel

2017-10-10

Tyler Faits (10:38:57): > @Tyler Faits has joined the channel

2017-10-27

Guangchuang Yu (01:27:36): > @Guangchuang Yu has joined the channel

Nicholas Clark (11:01:54): > @Nicholas Clark has joined the channel

natedolson (11:05:59): > @natedolson has joined the channel

Laurent Gatto (11:51:14): > @Laurent Gatto has joined the channel

Stian Lågstad (13:26:15): > @Stian Lågstad has joined the channel

2017-10-30

cruiz (05:37:56): > @cruiz has joined the channel

2017-11-02

Guangchuang Yu (06:01:28): > The GO.db data source is still half year ago. > > | GOEGSOURCEDATE: 2017-Mar29

Guangchuang Yu (06:01:41): > so as org.Hs.eg.db

Guangchuang Yu (06:02:06): > Are all these annotation pkgs not updated in the recent release?

Valerie Obenchain (08:30:40): > @Guangchuang YuThis question should be asked on bioc-devel. Show the package version you’re using, output of sessionInfo() etc.

2017-11-03

Nicholas Clark (15:23:28): > I want to add a bug fix to the release branch of my package. It is currently version 1.4.0 in the RELEASE_3_6 branch and version 1.5.0 in the master branch. Should I change the version to 1.5.1 in both RELEASE_3_6 and master or should it be 1.4.1 in the RELEASE_3_6 branch?

Sean Davis (15:24:09): > 1.4.1 in release. 1.5.1 in devel.

Nicholas Clark (15:30:12): > Okay. Do I have to “cherry-pick” the commit from the master branch like it says here?http://bioconductor.org/developers/how-to/git/bug-fix-in-release-and-devel/Can I do different commits on the master and release branches or will that mess something up?

Nicholas Clark (15:30:56): > Because the changes would be slightly different - different versions in each commit

Sean Davis (15:32:55): > There are multiple ways to do the bug fix on multiple branches with git, but following the instructions in the link is a good way to go.

Nicholas Clark (15:33:11): > Got it. Thanks

2017-11-09

Parham Solaimani (11:32:01): > @Parham Solaimani has joined the channel

Parham Solaimani (11:41:29): > @Marcel Ramos Pérez@Martin Morganthank you for the invitation.

2017-11-21

Leonardo Collado Torres (17:16:31): > How frequently do you find the answer to your own question when you are almost done posting the question? That just happened to me athttps://support.bioconductor.org/p/103299/. Other times I simply haven’t posted the question. This time I decided to post it, just to show that it can happen

2017-11-22

Martin Morgan (08:47:32): > I think that’s a major benefit of trying to make a reproducible example when asking a question – in the process, you realize what the problem is!

2017-11-28

Simina Boca (14:19:29): > @Simina Boca has joined the channel

2017-11-29

Matthew McCall (09:28:24): > @Matthew McCall has joined the channel

2017-12-06

Kevin Wang (19:13:47): > @Kevin Wang has joined the channel

2017-12-07

Federico Marini (18:48:06): > @Federico Marini has joined the channel

Dr H (23:24:38): > @Dr H has joined the channel

2017-12-08

Charlotte Soneson (04:14:31): > @Charlotte Soneson has joined the channel

2017-12-11

Marcel Ramos Pérez (15:41:41): > set the channel topic: Link to join the slack team - https://bioc-community.herokuapp.com/

Simina Boca (16:28:43): > I have to say that that link did not work for me

Simina Boca (16:29:02): > @Stephanie Hickswas kind enough to invite me

Ricard Argelaguet (16:46:10): > @Ricard Argelaguet has joined the channel

Kevin Rue-Albrecht (17:00:46): > For what it’s worth, I also had to invite a couple of people last week who had the same issue

2017-12-12

Simina Boca (14:51:30): > I often use the Human Metabolome Database (HMDB -http://www.hmdb.ca/) to get putative metabolites that correspond to specific m/z values in untargeted experiments

Simina Boca (14:53:06): > It doesn’t seem like there’s a BioC package to do this linkage and it’s a pain to either a) go to HMDB and download results as a CSV file (which isn’t always 100% dependable as that option was down for days) or b) go through some custom Python script a collaborator wrote that requires me to switch to a Mac from my usual Windows setup

Simina Boca (14:53:51): > Any interest here in working together on seeing if this can get connected with BioC?

Sean Davis (14:55:37): > Does HMDB offer a programmatic API? It seems not.

Simina Boca (14:58:07): > I don’t see it - only the actual fileshttp://www.hmdb.ca/downloads

Sean Davis (15:00:31): > So, if licensing permits, you could certainly wrap up those files as a data package along with helper methods for access. Would that facilitate your work?

Simina Boca (15:03:08): > I think so

Simina Boca (15:03:19): > I guess we would have to make sure we keep up to date with the versions in that case?

Sean Davis (15:05:08): > It seems they release every few years, but keeping up with their versions makes sense, generally.

Simina Boca (15:06:00): > I should probably ask them about licensing and such

Simina Boca (15:06:09): > They’re supported by Canadian research grants

Simina Boca (15:06:20): > So in theory should be OK with this

Simina Boca (15:18:06): > Are there any best practices for data packages specifically? - Sorry if it’s somewhere obvious, I’ve only had experience with software packages so far

Martin Morgan (15:53:22): > Is this relevant ?https://github.com/Bioconductor/Contributions/issues/493@Vince Carey - Attachment (GitHub): hmdbQuery package submitted · Issue #493 · Bioconductor/Contributions > Update the following URL to point to the GitHub repository of the package you wish to submit to Bioconductor Repository: https://github.com/vjcitn/hmdbQuery Confirm the following by editing each …

Simina Boca (15:55:19): > Possibly!

Simina Boca (15:55:24): > Thank you! Will take a look!

Sean Davis (15:58:59): > Nice….

Simina Boca (16:03:14): > I may ask@Vince Careyabout it:slightly_smiling_face:

2017-12-13

Vince Carey (20:40:58): > I would be happy to discuss this,@Simina Boca. Or file issues at the github repo.

2018-01-03

Stephane Ballereau (14:01:21): > @Stephane Ballereau has joined the channel

2018-01-09

Marcel Ramos Pérez (15:07:55): > @Simina Boca@Kevin Rue-AlbrechtSorry for the delay. The invite system should be fixed for the Heroku app. Let me know if there are any other issues.

2018-01-24

Derek Bazinet (rpci Dlar) (14:07:57): > @Derek Bazinet (rpci Dlar) has joined the channel

2018-01-27

Martin Morgan (15:46:56): > <!channel>We’re reaching our ‘free’ quota of 10,000 messages, after which older messages will be deleted. We (Bioconductor) could pay for our use and keep more messages, but personally I think of slack as fundamentally ephemeral, with more permant solutions available to the community (the bioc-devel mailing list and the support site, for instance). I’d therefore propose that we continue with the free plan, and let old messages expire. Please add reactions for approval (:+1:) or disapproval (:-1:), and start a thread for any substanive comments.

2018-01-31

Kasper D. Hansen (21:44:18): > Any plans for updating Rsamtools / Rhtslib. There seems to be “recent” changes to the SAM format related to super long reads from nanopore, and we find that we are unable to fully parse alignments generated by minimap2, which is supposedly one of the better current aligners for that data (written by Heng Li)

Kasper D. Hansen (21:44:52): > So far we plan to work around it with pre-filtering on the command line, but it seems it might be worth spending some effort on this, given the rise of nanopore.

Kasper D. Hansen (21:45:19): > One issue is that the “flag” field now accepts bigger values than 2047

Kasper D. Hansen (21:45:45): > I don’t know enough about this to know if the alignment files we have are typical or weird

Kasper D. Hansen (21:46:17): > I am aware that this is not easy due to the many changes in htslib

2018-02-01

Aaron Lun (04:15:35): > Yes, this has caused some grief on our end as well. Can’t manage to get BAM files for aligned nanopore data.

Martin Morgan (05:03:29): > Updating Rsamtools and Rhtslib are both on the radar, and both relatively big; Rhtslib might be more straight-forward on most platforms (swapping in current code to compile the library); Rsamtools should be updated to use Rhtslib. Rhtslib on Windows is more challenging, but documented in the package – for the original version, htslib wasn’t building on Windows and Nate assembled the appropriate tool chain / code changes to make that happen…

Martin Morgan (05:04:20): > see for instancehttps://github.com/Bioconductor/Rsamtools/pull/1, but the pull request requries update library code - Attachment (GitHub): support BAMs with >65535 CIGAR operators by lh3 · Pull Request #1 · Bioconductor/Rsamtools > This PR enlarges bam1_core_t::n_cigar to 32 bits. It is a necessary ABI change to hold cigars longer than 65535 in memory. For typical BAM records, the PR changes nothing. For a record with >65535 …

Martin Morgan (06:29:30): > I’ll try to make this a priority for the next several weeks.

Guangchuang Yu (08:16:33): > is there an option to ignore case inAnnotationDbi::selecthttps://www.biostars.org/p/296321/#296351

Sean Davis (09:20:47): > Not a direct answer to your question@Guangchuang Yu, but if you want more flexibility in name matching, there are a lot of options.https://www.biostars.org/p/296321/#296377

Aaron Lun (09:24:14) (in thread): > One of the other nice things about Rhtslib is its support for CRAM file input, which is not (I think) supported by Rsamtools. Admittedly, I haven’t heard of a lot of users clamouring for this, but perhaps it’s a case of “build it and they will come”.

Dror Berel (10:28:10): > @Dror Berel has joined the channel

Hervé Pagès (13:14:15): > Erick Matsen from the Hutch just posted this job opening on the support site: “Full-time bioinformatics instructor position at Fred Hutch”https://support.bioconductor.org/p/105557/

Martin Morgan (17:07:37): > Hmm, I wonder if there’s a ‘post-it-to-slack’ image / link that could be imbeded, like the tweetie bird and facebook links on support site issues (e.g., after the ‘Follow via email’ drop-down on the link Herv’e provides?

2018-02-23

Nancy Liu (15:16:27): > @Nancy Liu has joined the channel

2018-02-26

Dario Righelli (16:17:51): > @Dario Righelli has joined the channel

Davide Risso (16:25:34): > Hi all! As you may know@Levi Waldronand I have been co-organizing a R/Bioconductor meetup in NYC which has a pretty good turnout… We took inspiration from the one in Boston that has been going on for a while (I think@Aedin Culhaneand@Vince Careyorganize that?).@Dario Righelli, a visiting student from Italy, asked me if he can have Bioconductor’s blessing to organize a Bioconductor meetup in Naples once he goes back. First of all, I wanted to relay his question to the group… second of all, is there interest in creating a #meetup channel here to discuss possible new meetups and/or exchange materials among the existing ones?

Kevin Rue-Albrecht (16:28:26): > UK-based here, but i like the ideas of meetups and a dedicated channel to advertise them, hoping that I could occasionally have the blessing of my PI to join:slightly_smiling_face:

Aaron Lun (16:40:06): > What does this entail - weekly/monthly meetings? I’m sure a number of Cambridge (UK) people might be interested.

Levi Waldron (16:43:00): > We’ve been doing monthly hands-on workshop-style meetings, trying to make sure there’s always another scheduled event coming up:https://www.meetup.com/BiocNYC/ - Attachment (Meetup): New York City R/Bioconductor for Genomics (New York, NY) > This group will meet regularly to discuss topics related to the application and development of R and Bioconductor workflows and packages for data analysis and visualization of genomics data.

Aaron Lun (16:43:43): > Ah, there’s a classic figure.

Levi Waldron (16:43:58): > I had a special logo until meetup went to their new interface…

Levi Waldron (16:44:30): > but yes, I like this one

Davide Risso (16:46:50): > OK, given that there seems to be interest in at least discussing meet-ups, I went ahead and created the#meetupschannel!

2018-02-27

Samuela Pollack (09:23:24): > @Samuela Pollack has joined the channel

2018-03-01

Peter Haverty (15:54:31): > @Peter Haverty has joined the channel

2018-03-04

Stian Lågstad (07:33:42): > Setting up Travis CI to build a Bioconductor package was easier than I thought it would be. I posted some simple instructions here:http://stianlagstad.no/setting-up-a-continuous-integration-pipeline-for-an-r-bioconductor-package-with-travis/in case anyone else here would like to do the same. - Attachment (Stian Lågstad): Setting up a continuous integration pipeline for an R Bioconductor package with Travis > I’ve been thinking about setting up a continuous integration pipeline for the chimeraviz package for a while. Two things held me back: I thought it would be a hassle to make work. I was the o…

Stian Lågstad (08:28:24): > Related: Any tips for improvingR CMD check/devtools::check()time? Sometimes the Travis build fails because of the 50min time limit.

Kevin Rue-Albrecht (08:29:57): > that usually happens when many packages need to be updated (e.g. new release or long gap between two Travis builds). > My usual fix is to relaunch the Travis build. Typically the second try is already enough to complete without timeout

Stian Lågstad (08:31:28): > I noticed that when it actually got finished with installing dependencies and got them cached, the next build succeeded. But yesterday I had 4-5 failing builds before it finally managed to cache dependencies

Kevin Rue-Albrecht (08:31:32): > Building a large set of packages from source can take a long time, and there isn’t much one can do about it. Make sure the.travis.ymlis set to cache packages, by the way

Stian Lågstad (08:32:01): > Thanks:slightly_smiling_face:Good to know other people have experienced the same

Kevin Rue-Albrecht (08:34:14): > Yes, indeed. Just to give an example, just last week I had to throw in a fix to a package that I hadn’t touched in months because a breaking change was introduced in a dependency, Travis timed out because I basically had to reinstall all the dependencies from the new Bioc release branch

Stian Lågstad (08:35:45): > Hehe, a bit annoying, yeah. But Travis is supernice either way. Almost can’t believe it’s free.

Kevin Rue-Albrecht (08:37:18): > Same here:slightly_smiling_face:

2018-03-06

Stian Lågstad (16:00:23): > I wrote a short post about how I use the (super nice) Bioconductor Docker images:http://stianlagstad.no/developing-a-bioconductor-package-with-rstudio-and-docker/. Hope this is helpful for someone! (Rest assured, I don’t plan to post a lot like this here:) - Attachment (Stian Lågstad): Developing a Bioconductor package with RStudio and Docker > If you’re going to develop a Bioconductor package you’ll soon discover that your package has to work on both the development version and the release version of Bioconductor. This means …

2018-03-07

Elizabeth Purdom (14:34:48): > @Elizabeth Purdom has joined the channel

Nitesh Turaga (16:24:26): > Hi<!channel>, > > I realize a lot of the bioc-devel members hang out on this slack. > > We are cleaning up our Bioconductor organization on Github to make sure the organization stays secure, and access is limited to members to repositories as needed. > > If anyone is removed from a repository, or lost access to something, pleasereply back to the email thread on bioc-devel mailing list.

2018-03-16

Cotton Seed (11:27:00): > @Cotton Seed has joined the channel

Joey McMurdie (12:29:59): > @Joey McMurdie has joined the channel

2018-03-26

Jayaram Kancherla (14:07:39): > @Jayaram Kancherla has joined the channel

2018-03-27

Han Zhang (09:05:35): > @Han Zhang has joined the channel

2018-03-28

Laurel DiBrog (13:39:57): > @Laurel DiBrog has joined the channel

Hirak (14:18:55): > @Hirak has joined the channel

Rob Patro (14:25:08): > @Rob Patro has joined the channel

2018-03-30

Nitesh Turaga (16:17:00): > <!channel>https://stat.ethz.ch/pipermail/bioc-devel/2018-March/013090.html

2018-03-31

Aaron Lun (13:43:10): > Outstanding, thanks guys!

2018-04-22

Martin Morgan (17:17:17): > Check out Bioconductor core team jobs as programmer / analysthttps://www.roswellpark.org/careers/administrative/programmeranalyst-5817or senior programmer / analysthttps://www.roswellpark.org/careers/administrative/senior-programmeranalyst-5656Tell your friends!

2018-05-03

Kasper D. Hansen (13:47:13): > Do we have an invite link for this slack group?

Nitesh Turaga (13:47:39): > @Marcel Ramos Pérezwill know.

Kasper D. Hansen (13:47:44): > forloyalgoff@jhmi.edu

Marcel Ramos Pérez (13:47:56): > It’s in the channel description

Marcel Ramos Pérez (13:48:00): > https://bioc-community.herokuapp.com/

Kasper D. Hansen (13:48:17): > thanks

Marcel Ramos Pérez (13:48:21): > It should work I fixed it some time ago

Loyal (13:50:05): > @Loyal has joined the channel

Abbas Rizvi (16:12:24): > @Abbas Rizvi has joined the channel

Ezgi Karaesmen (16:12:32): > @Ezgi Karaesmen has joined the channel

2018-05-04

Peter Hickey (16:38:15): > The new ‘Extending the SummarizedExperiment class’ vignette is great (https://bioconductor.org/packages/devel/bioc/vignettes/SummarizedExperiment/inst/doc/Extensions.html). I’d consider myself a ‘middle-aged’ hand at this, and I’m still learning lots of great tips and tricks from reading the vignette. > Thanks@Aaron Lunand@Martin Morgan!

2018-05-09

Erik Wright (11:37:30): > @Erik Wright has joined the channel

2018-05-10

Edoardo Pasolli (12:27:20): > @Edoardo Pasolli has joined the channel

Nicola Segata (15:10:31): > @Nicola Segata has joined the channel

2018-05-11

Lori Shepherd (07:49:44): > Hello - any proteomics people have some suggestions for this support site post -https://support.bioconductor.org/p/108622/

Lori Shepherd (07:50:04): > Hello - any sequencing people have some suggestions for this support site post -https://support.bioconductor.org/p/108541/

2018-05-15

Aaron Lun (10:21:21): > Support site re-design looks nice. +1

Aaron Lun (10:46:08): > I wonder whether it is possible for “edit” mode to have more respect for markdown-specific stylings. For example, markdown quotes disappear when I want to edit the post (e.g., inhttps://support.bioconductor.org/p/108917/#108919), as does the pretty syntax highlighting in code chunks.

2018-05-16

Ruth Isserlin (14:05:04): > @Ruth Isserlin has joined the channel

James W. MacDonald (15:57:59): > @James W. MacDonald has joined the channel

2018-05-19

Rhonda Bacher (12:26:38): > @Rhonda Bacher has joined the channel

2018-05-21

Aedin Culhane (15:39:41): > Have you seenhttps://www.rpackages.io/? Anything Bioc can learn from its searchable interface to R packages? - Attachment (RPackages): RPackages - R Packages Search and Statistics > RPackages brings useful statistics and information about R packages.

Aaron Lun (15:40:14): > Definitely looks nice.

Aaron Lun (15:40:28): > It’s a shape those graphs don’t have packages on the nodes.

Aedin Culhane (15:40:35): > Has anyone tested their new IDE R code?https://www.pgm-solutions.com/rcode - Attachment (PGM Solutions): RCode - PGM Solutions > RCode is a powerful and modern IDE for developing in R.

Aaron Lun (15:41:31): > Ah, therpackages.iosearch doesn’t include Bioconductor packages. Well, not mine at least

Aedin Culhane (15:42:05): > No it doesn’t seem to include Bioc

2018-05-22

Martin Morgan (09:26:01): > @Mike Smith@Hervé Pagèsthishttps://support.bioconductor.org/p/108548/#109203might have left my competence – seems like H5Fcreate() fails when HDF5Array loads (HDF5Array is creating a cache of some sort…) even though the directory exists. On the user Windows system; best guess is something about writing to different file systems?

Lauren Fitch (11:30:15): > @Lauren Fitch has joined the channel

2018-05-23

Dario Righelli (09:42:09) (in thread): > no, i’m using rstudio, is it better?

2018-05-24

James Hawley (10:20:25): > @James Hawley has joined the channel

2018-05-31

Surajit Bhattacharya (14:39:01): > @Surajit Bhattacharya has joined the channel

2018-06-01

Aedin Culhane (14:39:57) (in thread): > I use RStudio also. Thats why I asked

2018-06-04

Michael Love (13:58:39): > @Michael Love has joined the channel

2018-06-05

Martin Morgan (08:28:11): > The conflicted packagehttps://cran.r-project.org/package=conflictedseems like it’ll save some tears > > > suppressPackageStartupMessages({ library(org.Hs.eg.db); library(tidyverse) }) > > select(org.Hs.eg.db, "BRCA1", "GENENAME", "SYMBOL") > Error in UseMethod("select_") : > no applicable method for 'select_' applied to an object of class "c('OrgDb', 'AnnotationDb', 'envRefClass', '.environment', 'refClass', 'environment', 'refObject', 'AssayData')" > > library(conflicted) > > select(org.Hs.eg.db, "BRCA1", "GENENAME", "SYMBOL") > Error: select found in 2 packages. You must indicate which one you want with :: > * dplyr::select > * AnnotationDbi::select >

Kevin Rue-Albrecht (08:33:06): > Hadley read in my mind (https://github.com/r-lib/conflicted): > “If you want to make this behaviour the default, you can load conflicted in your~/.Rprofile(the easiest way to find and edit this file is withusethis::edit_r_profile()): > > if (interactive()) { > suppressMessages(suppressWarnings(require(conflicted))) > } > >”

Federico Marini (08:47:39): > The latest ggplot iterations also had something withexprsif I remember correctly

Kevin Rue-Albrecht (08:49:22): > on a separate note, it reminds me that we’ll need to flesh out our NEWS file a bit more systematically, before we forget what we fixed and what we added

Kevin Rue-Albrecht (08:50:26) (in thread): > Where did you see those news? I’m looking at their NEWS file right now (https://cloud.r-project.org/web/packages/ggplot2/news.html) and a quick search doesn’t pick up any ‘exprs’

Federico Marini (08:50:33) (in thread): > Wrong channel?:stuck_out_tongue:

Kevin Rue-Albrecht (08:50:41) (in thread): > … or is it something about a future release?

Kevin Rue-Albrecht (08:50:49) (in thread): > oh damn

Federico Marini (08:50:51) (in thread): > https://github.com/tidyverse/ggplot2/issues/2509 - Attachment (GitHub): Exported function ggplot2::exprs conflicts with Bioconductor Biobase::exprs · Issue #2509 · tidyverse/ggplot2 > The recent export of exprs poses a real problem when used in combination with Biobase, which is basically systematically loaded when using any package from Bioconductor. I suspect this will affect …

Kevin Rue-Albrecht (08:51:40) (in thread): > I wonder why I thought you were posting on iSEE:wink:

Kevin Rue-Albrecht (08:53:39) (in thread): > oh wow.. that’s an interesting situation

Federico Marini (09:49:53): > @Martin Morgan: some other tears might end up being poured, still

Federico Marini (09:50:28): > I just started a Shiny app where indeed some “conflicts” are there

Federico Marini (09:50:41): > and the error-based mechanism just kills the shiny app

Federico Marini (09:51:48): > So putting those two lines in the.Rprofilemight be good for many cases, yet bad for others

Yuwei Ni (10:24:30): > @Yuwei Ni has joined the channel

Martin Morgan (10:38:57): > @Federico MariniI think it’s one of those programming things where it’s better to fail early and hard than to persist with subtle bugs that only get revealed after you’ve announced your cure for cancer…

Federico Marini (10:39:25): > Not there yet:wink:

Federico Marini (10:40:40): > but it is good anyway to have such a system available. My point was more like a heads-up for occasional shiny users, who might be more affected than others if they “blindly” follow Hadley’s word to put that snippet in the.Rprofile

Martin Morgan (17:36:33): > Bioconductor conference travel awards announced; check your in-boxes - Attachment (BioC 2018): BioC 2018: Where Software and Biology Connect > Where Software and Biology Connect. July 25 - 27, Toronto, Canada.

Diya Das (19:45:48): > @Diya Das has joined the channel

2018-06-06

Kelly Street (12:54:01): > @Kelly Street has joined the channel

2018-06-11

Michael Love (20:36:16) (in thread): > is this something you think that Bioc users should best load, given the number of conflicts btwn Bioc and tidy

2018-06-12

Martin Morgan (12:43:46) (in thread): > Seems like there are a couple of fairly substantial issues with conflicted & other packages (e.g., Rcpp Function() previously used evalq() on the search path; dplyr uses, via C++ code, sort() on the search path; S4 generics correctly promoting base functions aren’t treated correctly; conflicted advises fixes to package code that the typical user wouldn’t have access to …). I think users in general should hold off for the moment, but that in the long term this will be helpful. I couldn’t actually tell from your question whether you thought using conflicted was a good idea or not?

Michael Love (21:22:09) (in thread): > good to hear your thoughts. my opinion is that, we’ll need at least something like this to resolve the mess unless we want to change our coding paradigm to do more ofAnnotationDbi::select

2018-06-22

Jason Berndt (10:17:04): > @Jason Berndt has joined the channel

Albert Kuo (13:44:17): > @Albert Kuo has joined the channel

Martin Morgan (15:54:29): > I thought thishttps://github.com/Bioconductor/Rhtsgetcould be a fun weekend project if anyone is interested in lending a hand – a ‘streaming’ client for GA4GH data. - Attachment (GitHub): Bioconductor/Rhtsget > Rhtsget - Access GA4GH’s streaming API for read and variant retrieval

2018-06-25

Michael Love (09:57:20) (in thread): > @Martin Morgancool! > > a side topic but GA4GH related: Rob and I are making efforts to integrate with GA4GH on the transcriptome hashing project. they’ve settled on a truncated version of SHA-512 (not the standard truncation of 512 but so be it) to identify sequence but i think we will be able to sync with them, and so make use of future APIs to identify transcriptomes (no such API exists now but they will likely provide this in the future, and I’ve been bugging them about how it will be very useful)

Elana Fertig (16:03:31): > @Elana Fertig has joined the channel

Alex Hopkins (16:04:44): > @Alex Hopkins has joined the channel

Michael Love (16:35:34): > :wave:@Elana Fertig

2018-06-26

Elana Fertig (08:36:46): > Hey@Michael Loveand all! Thanks for the warm welcome — excited to see you all here!!! Please welcome also@Alex Hopkinsfrom our group who’s doing some cool work with TCR-sequencing that we’re hoping to put up on Bioconductor!

Michael Love (09:02:39): > :wave:@Alex Hopkins

Michael Love (09:02:52): > i’ll confess ignorance, what’s TCR seq again?

Alex Hopkins (09:23:21): > Thanks@Michael Love, it’s T cell (and B cell) receptor sequencing

Alex Hopkins (09:23:23): > https://www.ncbi.nlm.nih.gov/pubmed/?term=24140071 - Attachment (ncbi.nlm.nih.gov): Immunosequencing: applications of immune repertoire deep sequencing. - PubMed - NCBI > Curr Opin Immunol. 2013 Oct;25(5):646-52. doi: 10.1016/j.coi.2013.09.017. Epub 2013 Oct 16. Review

Alex Hopkins (10:19:12): > @Elana Fertigand I (at Hopkins) are working on a new class for this data type, would anyone here be interested in such a thing?

Kevin Rue-Albrecht (10:35:15): > I’m too short on time and not directly involved in this kind of work myself, but I do know that there is significant interest in this field in the MRC WIMM (Oxford), e.g.https://www.imm.ox.ac.uk/research/units-and-centres/mrc-wimm-centre-for-computational-biology/groups/computational-immunology-group - Attachment (imm.ox.ac.uk): Koohy Group: Machine Learning and Integrative Approaches in Immunology — MRC Weatherall Institute of Molecular Medicine > We would like to understand the functional and molecular mechanisms of the immune system in various immunologically important conditions such as cancer, infection, autoimmune disease as well as ageing. We have a special interest in computational cancer immunotherapy such as antigen presentation, neo-antigen identification and T cell recognition of neo-antigens as well as interrogating the immune response to personalized vaccines from neo-antigens.

Elana Fertig (10:42:07): > @Kevin Rue-Albrechtis it on Bioconductor?

Elana Fertig (10:42:32): > we saw a lot of CRAN, but nothing Bioc compatible

Kevin Rue-Albrecht (10:44:04): > Well, I didn’t mean that they’re developers of packages, I certainly know that they would be users of such packages. Haven’t had the chance to ask them what programs (R, CRAN, Bioconductor, Python, … ) they’re using for their current projects

Martin Morgan (10:49:36): > If you’re looking for advice on representing the data then a good way to go might be with a public (or private with invite…) GitHub repository where you can post a little sample data and your thoughts on how to represent it…

Levi Waldron (10:53:17): > @Alex Hopkins@Elana FertigI’m always interested in new data structures. There will be a “New Data Structures” SIG at Bioc2018, you could put it on the agenda by commenting on the Issue:https://github.com/Bioconductor/BioC2018/issues/8. - Attachment (GitHub): SIG: new data structures for Bioconductor · Issue #8 · Bioconductor/BioC2018 > From @lwaldron on October 22, 2017 4:26 This SIG will discuss recent and needed Bioconductor data classes. Some recent or in-testing data classes to discuss are: MultiAssayExperiment (for "glu…

Vince Carey (11:56:40): > We’ve had a little back and forth in email.https://github.com/ahopki14/tcrSeqR - Attachment (GitHub): ahopki14/tcrSeqR > tcrSeqR - An R package for analyzing TCR sequencing data

Vince Carey (11:56:58): > Question has been role of MultiAssayExperiment

Vince Carey (12:05:07): > While we are at it (should we make a new channel?) some of the material at AIRR may be relevant?http://airr.irmacs.sfu.ca/

Alex Hopkins (12:10:51): > That is very helpful, thanks@Vince Carey

Vince Carey (13:07:08): > It would be good to state here what seems to be lacking in the SummarizedExperiment for representing this data.

Raphael Gottardo (13:43:36): > We’re doing quite a bit of work on TCR/BCR seq, and would love to be involved.@Greg FinakI second@Vince Carey’s suggestion to carefully study the AIRR for standards.

Alex Hopkins (15:01:01): > Sure,@Vince Carey: the main thing seems to be that TCR data can be represented at the nucleotide level or amino acid level, and storing them as multiple assays is difficult because the dimensions are not the same (in SummarizedExperiment). And@Raphael Gottardo, we will be sure to check the AIRR as we look at this, thanks.

Valentin Voillet (16:40:07): > @Valentin Voillet has joined the channel

Rob Amezquita (17:52:27): > @Rob Amezquita has joined the channel

2018-06-27

Nicholas Cooley (11:52:06): > @Nicholas Cooley has joined the channel

2018-06-28

Vince Carey (11:35:40): > I had a look at some of the tsv files that were provided … is it the case that the nucleotide representation is always present, but the aminoAcid is only present part of the time? It seems that there is one assay with unique features given by the nucleotide sequence of TCR, and the aminoAcid representation of the sequence is metadata that is only available for a subset of results. There are events where a given aminoAcid sequence corresponds to multiple nucleotide sequences. I don’t think there is much downside to using MultiAssayExperiment for the two representations, but it might be too heavy if all you really need are operations on the nucleotide-level features to work at the amino acid level.

Kasper D. Hansen (14:30:47): > I am recommending a fast nucleotide -> aminoacid converter

2018-07-09

Alex Hopkins (10:32:30): > Thanks@Vince Careyand@Kasper D. Hansen(and sorry for the late reply). I might take a look at both options to see how they perform. The MultiAssayExperiment approach will use more memory but be faster for large experiments, while a converter will use less memory to store the object, but might get slow…

2018-07-10

Petr Smirnov (08:02:44): > @Petr Smirnov has joined the channel

2018-07-15

Daniel Giguere (21:09:44): > @Daniel Giguere has joined the channel

2018-07-17

Michael Love (16:03:03): > is it possible that R’scheckis getting slower? I just noticed with a recent package i’m working on it takes 3:20 just to get tochecking exampleson my 2015 Mac. i have a lot ofimportFromcalls but that’s not really new for my packages, and i didn’t notice this before. i’ve got <1:40 left for examples, unit tests, and vignette.checkfinishes on my machine in 4:15 but not on the Windows single package builder. the vignette takes 25 seconds to create, so even if i pared the docs down to less of a real example, I wouldn’t gain enough to get myself under 5 minutes on the Windows builder

Kasper D. Hansen (16:04:27): > checkis expanding in scope and thoroughness all the time. Not sure if that explains what you’re seeing and what you’re comparing against

Kasper D. Hansen (16:05:45): > I have just been looking at illuminaio again and that is a super lean package - no imports, little code etc.

Kasper D. Hansen (16:06:00): > 41sfor the entire check

Kasper D. Hansen (16:06:08): > (on a macbook air)

Michael Love (16:07:05): > checktakes 0:50 on my machine to just get through these: > > * checking whether the package can be loaded ... OK > * checking whether the package can be loaded with stated dependencies ... OK > * checking whether the package can be unloaded cleanly ... OK > * checking whether the namespace can be loaded with stated dependencies ... OK > * checking whether the namespace can be unloaded cleanly ... OK >

Kasper D. Hansen (16:07:20): > checkchecks for loading. That can be pretty slow by itself

Martin Morgan (16:08:03): > one culprit might be slow internet connection, because R CMD check connects to various repositories; I noticed this in Laurent’s build-a-package demo at CSAMA…

Kasper D. Hansen (16:08:24): > hmm, interesting

Michael Love (16:08:34): > oh i’m definitely on a slow connection

Kasper D. Hansen (16:08:48): > system.time(library(minfi))takes 10s

Michael Love (16:09:14): > but also the Bioc Windows single package builder is slower

Michael Love (16:10:33): > on single package buildercheckis taking: 5:12 for windows, 2:50 for linux, 3:20 for mac

Martin Morgan (16:10:58): > windows builds & checks 32 and 64 bit versions so is approximately 2x longer

Michael Love (16:12:08): > does the 5 min check rule still apply to the Windows machine? or is it enough to get it under 5 for the other two machines

Michael Love (16:15:37): > i think the problem is that this package builds on a lot of other packages, but not in an easily dispensable way

Martin Morgan (16:31:17): > I think@Lori Shepherdand your package reviewer can be more helpful about the rules.

2018-07-18

Michael Love (00:55:18): > Ok I’ll switch over to the pkg issue on GH. This may be idiosyncratic problem given the number of imports I have. Thanks for the info

Lori Shepherd (07:13:25): > For reference here - its generally the reviewers discretion concerning exceptions to warnings/errors but I think our generally consensus on the team is if it is building under 5 on the other two os and the windows check is reasonable we let it through.

Michael Love (07:41:55): > Ok thanks for info@Lori Shepherd

Vince Carey (12:29:03): > What do we know about profiling R CMD check and library()? It could be very useful to know explicitly how the time is being used.

2018-07-19

Sehyun Oh (13:58:10): > @Sehyun Oh has joined the channel

Chantal Ho (14:10:31): > @Chantal Ho has joined the channel

Brendan Innes (14:15:38): > @Brendan Innes has joined the channel

Jenny Drnevich (14:40:48): > @Jenny Drnevich has joined the channel

Nima Hejazi (15:16:57): > @Nima Hejazi has joined the channel

Qiang Hu (17:36:39): > @Qiang Hu has joined the channel

2018-07-20

Erik Drysdale (11:08:17): > @Erik Drysdale has joined the channel

2018-07-23

David Dilworth (08:58:56): > @David Dilworth has joined the channel

solomon shorser (14:00:33): > @solomon shorser has joined the channel

Robin Haw (14:02:19): > @Robin Haw has joined the channel

C. Mirzayi (please do not tag this account) (15:50:33): > @C. Mirzayi (please do not tag this account) has joined the channel

2018-07-24

Tom Belbin (09:59:16): > @Tom Belbin has joined the channel

Zhaleh Safikhani (16:01:30): > @Zhaleh Safikhani has joined the channel

Eric Milliman (17:56:44): > @Eric Milliman has joined the channel

2018-07-25

Domenick Braccia (07:00:58): > @Domenick Braccia has joined the channel

Phil Cheng (07:29:56): > @Phil Cheng has joined the channel

James MacDonald (08:35:30): > @James MacDonald has joined the channel

Jeff Johnston (08:58:45): > @Jeff Johnston has joined the channel

Kasper D. Hansen (09:07:56): > We have several channels which appear unused (or perhaps it’s just been a little my time since anyone used it). While the topics are of interest I think we should archive them

Shian Su (09:19:05): > @Shian Su has joined the channel

Neke Ibeh (09:28:31): > @Neke Ibeh has joined the channel

Michael Steinbaugh (10:14:03): > For Bioc2018 where’s the Google Docs link for the lightning talks?

Miles (10:18:41): > @Miles has joined the channel

Leonardo Collado Torres (10:23:41): > https://twitter.com/RLadiesGlobal/status/1022031250759053314 - Attachment (twitter): Attachment > :loudspeaker: Join our new :sparkles: R-Ladies community Slack! :sparkles: > > It’s aiming for a safe & global space to discuss topics and share ideas around #rstats & the #rladies community! :purple_heart: > > We invite all non-cis male R users to sign-up via http://bit.ly/rladies-slack :rocket:

Derek Nedveck (10:42:21): > @Derek Nedveck has joined the channel

Sean Davis (10:52:59): > @Michael Steinbaugh: herehttps://docs.google.com/document/d/1hhd5X3_Tj2WJhtt9Ro_WjYgBJI5221tRWg_7ZyMj_AM/edit?usp=sharing - File (Google Docs): BioC 2018 Lightning Talks

Michael Steinbaugh (10:55:27): > @Sean DavisThanks Sean!

Leonardo Collado Torres (10:58:41): > I added a few emojis & gifs: > > :beret-parrot: :cdsb: :christmas-parrot: :dealwithit-parrot: :dna: :fast_parrot: :fiesta_parrot: :hex: :jaccard: :parrotconga: :party_parrot: :r: :reverse_conga_parrot: :rladies: :ropensci: :sad-parrot: :shuffle-parrot: :trump_emoji: :bioc: :overleaf: :biorxiv: :unam: > > :beret-parrot::cdsb::christmas-parrot::dealwithit-parrot::dna::fast_parrot::fiesta_parrot::hex::jaccard::parrotconga::party_parrot::r::reverse_conga_parrot::rladies::ropensci::sad-parrot::shuffle-parrot::trump_emoji::bioc::overleaf::biorxiv::unam:

Leonardo Collado Torres (10:59:46): > you can add more if you followhttps://get.slack.help/hc/en-us/articles/206870177-Add-custom-emoji - Attachment (Slack Help Center): Add custom emoji > Emoji are fun, but they’re functional too! Use them to enhance your messages, react to activity, or with the Reacji channeler app to copy messages. Whatever your needs might be, you can customize y…

Sean Davis (11:22:36): > I’m particularly interested to see how these “enhance our messages” as noted above.:fast_parrot:

Sean Davis (11:52:28): > It’s working! My message is very enhanced!

Miles (13:31:30): > Are the slides from the lighting talks available somewhere?

Nitesh Turaga (13:33:51): > www.bit.ly/scale-bioc

Sara Keeble (14:17:38): > @Sara Keeble has joined the channel

Angeline Yasodhara (14:39:31): > @Angeline Yasodhara has joined the channel

James Taylor (15:46:40): > @James Taylor has joined the channel

Ben Johnson (16:00:12): > @Ben Johnson has joined the channel

Simon Coetzee (16:06:51): > @Simon Coetzee has joined the channel

Stephen Turner (16:17:46): > @Stephen Turner has joined the channel

BJ Stubbs (16:42:01): > @BJ Stubbs has joined the channel

Domenick Braccia (16:51:10): - File (Google Docs): Strategies for posting and answering support site questions

Leonardo Collado Torres (20:11:13) (in thread): > @Michael LoveI forget if the suggestions we talked about a while ago in the bioc-devel mailing list were implemented or not. Do you know?

Michael Love (20:16:14) (in thread): > yes, they are in

Michael Love (20:17:01) (in thread): > https://github.com/Bioconductor/bioconductor.org/commit/c6e5c468da9fd5108204f590161640ea4e9df9d2 - Attachment (GitHub): posting guide revisions from M. Love · Bioconductor/bioconductor.org@c6e5c46 > Website for bioconductor.org

Michael Hoffman (21:21:56): > @Michael Hoffman has joined the channel

Michael Hoffman (21:23:10) (in thread): > My man!

Anthony (23:59:44): > @Anthony has joined the channel

2018-07-26

Kasper D. Hansen (00:10:39) (in thread): > Im learning from the best

Meng-Chun (07:26:12): > @Meng-Chun has joined the channel

oclark (09:00:33): > @oclark has joined the channel

Farnush Farhadi (09:02:04): > @Farnush Farhadi has joined the channel

Rachael Phillips (09:25:07): > @Rachael Phillips has joined the channel

Yali Zhang (10:17:20): > @Yali Zhang has joined the channel

Aake v (10:20:07): > @Aake v has joined the channel

Michael Steinbaugh (10:30:58): > Is there a current recommended best practice for S4 object coercion with arguments? I’m usingsetAs(from, to)inside my package to provideas()method support, but this doesn’t allow...for additional arguments

Andrea Mcewan (10:37:51): > @Andrea Mcewan has joined the channel

Peter Hickey (10:43:56) (in thread): > i think you might be stuck withas()being argument-less and you’ll have to create a dedicated function. > E.g., there’s aas(data.frame, "GRanges")method but there’s alsomakeGRangesFromDataFrame()to provide many more options for how this coercion takes place

Michael Steinbaugh (10:44:58) (in thread): > But using an approach likeconvert(from, to, ...)instead isn’t recommended?

Peter Hickey (10:45:41) (in thread): > i don’t know. as far as i know, there isn’t aconvert()generic, but i can see the argument for one

Daniela Cassol (10:46:05): > @Daniela Cassol has joined the channel

Michael Steinbaugh (10:46:19) (in thread): > I’m going to stick with simpleas()method support for now but it’d be cool to add some functionality like that in a BiocGeneric

Krithika Bhuvanesh (11:49:11): > @Krithika Bhuvanesh has joined the channel

Kasper D. Hansen (12:01:05): > Soas()allows for automatic conversion. Having arguments philosphophically destroys that

Kasper D. Hansen (12:01:30): > It like “I can imagine this object being this other class, but to do so I need more information”

Matthew Oldach (12:53:08): > @Matthew Oldach has joined the channel

Destiny McNeece-Mullens (14:54:04): > @Destiny McNeece-Mullens has joined the channel

2018-07-27

Levi Waldron (09:25:26) (in thread): > For any generic already as widely used asas(), I think its arguments are not likely to change.convert(from, to, ...)might have some advantage of providing a go-to-function if it were adopted widely, but on the other hand the...arguments are a little harder to use than a specific function with its own arguments, e.g. no pop-up tips, auto-complete, or argument matching, user can’t see the code just by typing the function name. Just to say IMO there are some trade-offs between having a widely-used generic vs. a normal function for a specific use.

Michael Steinbaugh (09:32:29) (in thread): > Thanks Levi that’s really helpful

Stephen Guest (09:58:29): > @Stephen Guest has joined the channel

Lily Wang (16:46:15): > @Lily Wang has joined the channel

2018-07-31

Frederick Tan (14:15:41): > @Frederick Tan has joined the channel

2018-08-01

Charity Law (18:47:14): > @Charity Law has joined the channel

2018-08-02

Chiaowen Joyce Hsiao (09:28:14): > @Chiaowen Joyce Hsiao has joined the channel

2018-08-13

Diya Das (18:44:30): > Has anybody tried to deploy toshinyapps.iosince upgrading to R 3.5 / BiocManager? I’m getting a well-known error > > Error: Unhandled Exception: Child Task 542891378 failed: Error building image: Error fetching GenomicRanges (1.33.7) source. Error downloading package source. Please update your BioConductor packages to the latest version and try again: <BioconductorPackageSourc > In addition: Warning message: > 'BiocInstaller' and 'biocLite()' are deprecated, use the 'BiocManager' CRAN > package instead. > Execution halted > > …but I am using BiocManager in my script.

2018-08-14

Martin Morgan (08:14:51): > The message about BiocInstaller will show up whenever BiocInstaller is loaded, including by a package that Depends: or Imports: it and has not been updated…

Axel Klenk (10:45:10): > @Axel Klenk has joined the channel

Steve Niu (17:48:57): > @Steve Niu has joined the channel

2018-08-16

Marcus Kinsella (12:04:46): > @Marcus Kinsella has joined the channel

2018-08-17

branko (10:03:09): > @branko has joined the channel

2018-08-27

Malte Thodberg (05:53:51): > @Malte Thodberg has joined the channel

Juan R Gonzalez (14:58:41): > @Juan R Gonzalez has joined the channel

2018-08-29

rizoic (04:07:34): > @rizoic has joined the channel

2018-09-06

Aaron Lun (11:43:17): > Quick question: does anyone know how to get R tonottry to build PDF vignettes during installation? Currently getting these errors in the install: > > processing 'displaylist.Rnw' > Error: compiling TeX file 'displaylist.tex' failed with message: > unable to run 'pdflatex' on 'displaylist.tex' > Execution halted > building/updating vignettes for package 'parallel' ... > processing 'parallel.Rnw' > Error: compiling TeX file 'parallel.tex' failed with message: > unable to run 'pdflatex' on 'parallel.tex' > Execution halted > building/updating vignettes for package 'utils' ... > processing 'Sweave.Rnw' > Error: running Sweave on vignette '/home/jmlab/software/R/R-3-5-branch_devel/src/library/utils/vignettes/Sweave.Rnw' failed with message: > chunk 2 > Error in texi2dvi(file = file, pdf = TRUE, clean = clean, quiet = quiet, : > unable to run 'pdflatex' on 'example-1.tex' > Execution halted > make[1]: ***** [vignettes-lattice] Error 1 > make[1]: Leaving directory `/home/jmlab/software/R/R-3-5-branch_devel/src/library' > make: ***** [vignettes] Error 2 > > I mean, it doesn’t hurt anything - there’s still a usable R binary - but I’d like to know if I can avoid these in the first place.

Aaron Lun (11:59:59): > It also seems that Ctrl-C’ing out of aBiocManager::install()call fails to remove the00LOCKfiles in the R installation directory.

Kevin Rue-Albrecht (14:01:54) (in thread): > R CMD build--no-build-vignettes?

Kevin Rue-Albrecht (14:02:04) (in thread): > I can’t see the option inR CDM installindeed

Kevin Rue-Albrecht (14:03:36) (in thread): > In which case, I suppose you’d; have to break in down intoR CMD build --no-build-vignettesfollowed byR CMD install <path to build>

Sean Davis (17:01:38): > Any thoughts on this one? > > git clone[git@git.bioconductor.org](mailto:git@git.bioconductor.org):packages/GenomicDataCommons > cd GenomicDataCommons > > In R, I rundevtools::test(), I get an error in thetest_legacy.Rfile. I rundevtools::test()again in the same R session, the same tests run successfully.

Sean Davis (17:02:19): > The build system is picking up the same error. I hadn’t noticed it locally since I was running tests regularly.

Sean Davis (17:05:23): > And the build report:https://bioconductor.org/checkResults/3.8/bioc-LATEST/GenomicDataCommons/malbec1-checksrc.html

Sean Davis (17:11:33): - File (Plain Text): sessionInfo

Martin Morgan (17:23:03) (in thread): > Installation of R, right? you could maybe look at the default target in the top level Makefile > > all: Makefile Makeconf R docs recommended vignettes javaconf > > and guess that you could execute those one at a time omittingdocs. Maybe.

Kevin Rue-Albrecht (17:27:41): > I’m just looking at it now out of curiosity (I’ve never used the package before), but I spotted some copy paste leftover while reading the doc:?query: in section “Function”, three of the bullet points state the identical “convenience contructor for a GDCQuery for cases” (note also the typo of constructor) > [sorry for the spelling-nazi-beside-the-point comment!]

Sean Davis (17:28:21): > Thx.

Kevin Rue-Albrecht (17:29:35): > On a separate note, when I run the tests interactively, I get the following: > > > cache = gdc_cache() > Would you like to create a GDC Cache directory at /Users/kevin/Library/Caches/GenomicDataCommons > > 1: Yes > 2: No > > Do you know what the expected behaviour is when running non-interactively?

Sean Davis (17:30:57): > There is a!interactive()in there. In other words, the question is not asked.

Kevin Rue-Albrecht (17:32:05): > duh. sorry ^^

Sean Davis (17:32:44): > If you were mislead, let me know.

Vince Carey (17:33:51): > I can’t test now but the fail-then-succeed sounds like something that happens with authentication complications?

Kevin Rue-Albrecht (17:34:18): > from your description my thought was that something downstream of the faulty line somehow changed the environment as by product, making it work the next time, but I can’t see anything obvious, and as I said, I haven’t used the package before. just curious about it:slightly_smiling_face:

Sean Davis (17:35:01): > Should be no authentication here.

Sean Davis (17:35:38): > Posted to Stackoverflow, also.https://stackoverflow.com/questions/52212397/how-do-i-debug-an-error-that-occurs-specifically-in-the-testthat-context - Attachment (Stack Overflow): How do I debug an error that occurs specifically in the testthat context? > I am trying to debug an error that arises when I test a package I have developed. To reproduce what I see: git clone mailto:git@git.bioconductor.org|git@git.bioconductor.org:packages/GenomicDataCommons cd GenomicDataCommons The…

Sean Davis (17:36:54): > Running the code outside the testthat context works fine the first time around, also.

Kevin Rue-Albrecht (17:38:50): > Hehe I was going to say that last thing myself. That’s what I initially tried even before the devtools::testthat

Kevin Rue-Albrecht (17:40:39): > oh.. wait a second, this time I got: > > > files_legacy_ids = files(legacy = TRUE) %>% results(size = 10) %>% ids() > Error in is.response(x) : Not Found (HTTP 404). >

Kevin Rue-Albrecht (17:40:50): > same for > > > cases_legacy_ids = cases(legacy = TRUE) %>% results(size = 10) %>% ids() > Error in is.response(x) : Not Found (HTTP 404). >

Kevin Rue-Albrecht (17:42:00): > Although obviously that’s not the same error as you initially reported (and my connection is a bit spotty right now)

Sean Davis (17:44:15): > Thanks. That is the same error that I see under testthat. Not sure why it is so reproducibly present and then absent. Your code above is in an interactive R session?

Kevin Rue-Albrecht (17:45:53): > yes, it is an interactive session

Kevin Rue-Albrecht (17:46:13): > I’ve restarted R, and ran again through the test files in alphabetical order

Kevin Rue-Albrecht (17:56:00): > wow.. I’ll have to give up here, but I went as far asresponse.GDCQuerywhere things are going beyond me in terms ofresponse_handler(httr::content(.gdc_post(...

Sean Davis (17:57:30): > Thanks for the heroic efforts. Good to know that someone else sees similar behavior.

Sean Davis (18:00:52): > A little experiment proved fruitful. I added a small Sys.sleep(5) to the top of the problematic test file and the error disappears.

Sean Davis (18:01:25): > Off to work with the API maintainers to figure out what is going on on their end.

Kevin Rue-Albrecht (18:01:30): > Haha No worries. I wouldn’t call that heroic yet.. Wish I could help more, but as a general rule I stay away from unit testing things that depend on external connections

Sean Davis (18:01:47): > I would if I could!

Kevin Rue-Albrecht (18:01:48): > (burned myself unit testing biomart queries)

Sean Davis (18:02:33): > The API and underlying data have, in the past, changed without any notification, so I have to at least try.

Kevin Rue-Albrecht (18:02:51): > I was vaguely wondering whether the unit test - due to its non-interactive environment - is ‘too fast’ for whatever handshake the connection needs

Kevin Rue-Albrecht (18:03:14): > because it’s weird that it would work for all attempts >=2

Kevin Rue-Albrecht (18:03:28): > it feels like the connection needs a bit of time to warm up

Sean Davis (18:03:45): > One would still expect something other than an intermittent 404 error. That is the part I’ll target with the GDC team.

Kevin Rue-Albrecht (18:04:18): > Alright. my good deed of the day is done then. Good luck with the follow-up!

Sean Davis (18:04:32): > +5 karma points to you.

Kevin Rue-Albrecht (18:06:55): > Shameless advertising: feel free to ping the iSEE channel if there’s anything from the GDC that we can showcase in an app. I’ve used some TCGA from the ExperimentHub data to demo our largest data set yet (~8k data points), but maybe I’ve missed something neat in all that data!:wink:

Martin Morgan (18:26:22): > In new sessions, if I rundevtools::test(filter="legacy")everything is fine. If I comment out the unit testbeforetrouble, the final test intest_data.R, and rundevtools::test(), everything is fine. If I revert to the original and run the tests a second time, I believe that the final test in test_data.R actually uses the cache rather than querying the server. So I think what’s going on is the download in final test of test_data.R is leaving the connection in an uncertain state, and the legacy interface fails. I don’t know why.

Sean Davis (18:30:11): > Thanks, Martin. That is a variation I hadn’t tried. Some great hypotheses are being generated here….:pensive:

2018-09-10

Marcel Ramos Pérez (15:47:45) (in thread): > Thanks Aaron for pointing this out. I’ll look into it.

Levi Waldron (18:06:54): > I’m organizing a planning committee for Bioc2019 in New York City (at NYU and Rockefeller University), June 24-26. There is lots to be done including peer review of proposed talks, workshops, and posters, developing the programme, adapting the web site (e.g. seehttp://bioc2018.bioconductor.org/), seeking sponsorship, promoting the conference, and creating next year’s workshop booklet (e.g. seehttps://bioconductor.github.io/BiocWorkshops/). If you would like to take part, let me know, and I will include you in a kick-off planning meeting within the next couple weeks. - Attachment (BioC 2018): BioC 2018: Where Software and Biology Connect > Where Software and Biology Connect. July 25 - 27, Toronto, Canada. - Attachment (bioconductor.github.io): The Bioconductor 2018 Workshop Compilation > This book contains all the workshops presented at the Bioconductor 2018 Conference

2018-09-13

Kayla Interdonato (11:09:10): > @Kayla Interdonato has joined the channel

2018-09-20

JiefeiWang (12:44:35): > @JiefeiWang has joined the channel

2018-09-21

Martin Morgan (06:19:25): > At the Bioc conference we usedhttps://sli.dofor some audience interaction. The ‘free’ version is quite restrictive, e.g., only 3 polls. Are there alternatives that people have had good experience with?

2018-09-24

Kim-Anh Lê Cao (21:43:16): > @Kim-Anh Lê Cao has joined the channel

2018-09-25

Juan R Gonzalez (08:48:27): > Hi all (sorry if something I write has been previosuly discussed, just added to the group some days ago). > 1. (Proposal) I’m developing a package to perform omic data integration using multivariate methods (generalized canonical correlation among others) usingMultiDataSetandMultiAssayExperimentobjects as input (DelayedArray will also be considered). In particular, I’m interested in the case of having missing individuals (you know that omic tables have individuals who have not been meassured in a given dataset or are removed from the analysis after QC in a given table). Is there anyone doing similar things or want to collaborate by testing the package? (of course, the method for complete cases data analysis will also be implemented as a particular case). > 2. (Question) I want to implement the methods usingRcppParallel. Does anyone know whether this approach is better than combining R functions andBiocParallel? To me, implementing the SVD, matrix multiplication, … at low level using paralell methods is more simple than using BiocParallel. Is there any problem for BioConductor when not using BiocParallel? Thx!

Martin Morgan (09:32:34) (in thread): > For question 2, RcppParallel sounds like an appropriate solution for the level that you’re talking about. Perhaps for pure matrix algebra different approach is to rely on the user desiring performance to have a parallel BLAS libraryhttps://cran.r-project.org/doc/manuals/r-release/R-admin.html#BLAS. BiocParallel is appropriate for high-level, lapply-style parallel evaluation; it can be important to figure out whether the process is already parallelized, so that one doesn’t over-commit. I don’t think there’s a standardized way to do that. - Attachment (cran.r-project.org): R Installation and Administration > R Installation and Administration

Malte Thodberg (11:38:28): > What’s the best place for minor suggestions and bugs for central Bioconductor packages (GenomicRanges, S4Vectors, BiocGenerics, etc)? > For example, I’ve been working a lot with the new GPos-objects, and have found many small quirky things when using GRanges-methods on GPos-objects.

Vince Carey (11:40:25): > Why notsupport.bioconductor.org? It sounds like an application/set of concerns that would be of pretty general interest given the centrality of GRanges and emerging interest in GPos.

Levi Waldron (12:22:45) (in thread): > Hi Juan, there’s discussion in the#bigdata-repchannel about low-level analysis of DelayedArray objects and things like SVD and cross-products. I guess that’s not quite what you’re talking about though, if your focus is on methods for multi’omic analysis.

Juan R Gonzalez (13:10:16) (in thread): > Thank you Levi. Acually, I’m implementing those types of operations using DelayedArray! (just programmed the inversion of a matrix using Cholesky decomposition). These are basic algebra required to implemente Generalized Canonical Correlation (e.g a generalization of PCA). Just added to the channel!

2018-09-28

Saad Khan (12:19:49): > @Saad Khan has joined the channel

2018-10-01

Marisa Isabell Metzger (04:22:08): > @Marisa Isabell Metzger has joined the channel

2018-10-02

Nicholas Knoblauch (16:31:51): > @Nicholas Knoblauch has joined the channel

2018-10-06

Aaron Lun (13:15:24): > ARGH:sad-parrot: - File (PNG): Pasted image at 2018-10-06, 6:15 PM

Aaron Lun (13:16:06): > Windows 32. WHY?

Peter Hickey (21:13:50): > i’m also facing this:disappointed:I might have access to a Window VM through work but i think it’ll be 64 bit (and also means learning how to spin up a VM and install everything required …) > any tips for debugging? any common sources of Windows 32-bit specific errors/bugs i should look at first?

Peter Hickey (21:15:31): > Right now I’m just skipping the offending test on Windows (testthat::skip_on_os()) …

Sean Davis (22:16:04): > Not sure if this helps, but AWS has windows 32-bit (2008 server) images, I think.https://aws.amazon.com/marketplace/pp/B007O1Y1QW/ref=mkt_ste_windows_amis - Attachment (aws.amazon.com): AWS Marketplace: Microsoft Windows Server 2008 Base

Peter Hickey (23:10:14): > thanks, Sean! I’ll take a look

2018-10-07

Davide Risso (02:56:34): > If you need to test a package that lives on github appveyor might help. Not sure about 32 vs 64 bit though

2018-10-09

Aaron Lun (08:12:01): > On another note; I’d like to change the email on my BioC support site account. Who do I have to talk to?

Martin Morgan (09:36:25): > yourself, I think – visit your user page, choose edit profile, change email address.

Aaron Lun (09:39:47): > oh - okay.

Aaron Lun (10:22:06): > Yes, I see it now. Hmm, I had thought this was harder - oh well.

Lucas Schiffer (10:28:02): > “Of course I talk to myself, sometimes I need expert advice”

Kevin Rue-Albrecht (10:40:30) (in thread): > I’ll probably cite/reference this one in the future ^^

Kevin Rue-Albrecht (10:43:25) (in thread): > For now it makes for a nice office poster

2018-10-10

Aaron Lun (08:23:37): > FYI, my Bioc support site avatar is now 3 times cuter than before.

Stephanie Hicks (20:38:58): > ::goes to bioc support site::

Stephanie Hicks (20:39:19): > awww! who is the creature?

2018-10-16

Gabriele Sales (08:55:34): > @Gabriele Sales has joined the channel

2018-10-22

Aaron Lun (15:43:52): > Does CRAN have dead maintainer policies?

Aaron Lun (15:44:05): > Just curious.

Levi Waldron (22:02:55): > I recall maintainers being threatened with package removal if they didn’t address warnings or errors, but I don’t know if there’s a written policy.

2018-10-23

Martin Morgan (02:11:41): > CRAN policy is athttps://cran.r-project.org/web/packages/policies.html; the most relevant part is > > Packages will not normally be removed from CRAN: however, they may be archived, including at the maintainer's request. > > Packages for which R CMD check gives an 'ERROR' when a new R x.y.0 version is released will be archived (or in exceptional circumstances updated by the CRAN team) unless the maintainer has set a firm deadline for an upcoming update (and keeps to it). > > Maintainers will be asked to update packages which show any warnings or significant notes, especially at around the time of a new x.y.0 release. Packages which are not updated are liable to be archived. > - Attachment (cran.r-project.org): CRAN Repository Policy > CRAN Repository Policy

Aaron Lun (04:03:34): > Hm, okay.

Aaron Lun (09:40:44): > @Martin MorganIf I want to set a seed inside the function and reset it on exit, what would be the best way to do it? Is there a best way of doing it?

Aaron Lun (09:41:10): > This is mainly motivated by the observation that a lot of my function calls need to be preceded withset.seed, which is becoming a bit irksome.

Martin Morgan (09:53:55): > to avoid answering the question:wink:I wonder what the use case is for using set.seed in a function? Shouldn’t the function be robust to seed? I could understand for reproducibility in a work flow that a seed might be set in thescript…

Kevin Rue-Albrecht (09:55:13): > > trick_seed <- function(fun){ > function(...){ > os <- .Random.seed > on.exit( assign( ".Random.seed", os, envir = globalenv() ) ) > set.seed(Sys.time()) > fun(...) > } > } > sample <- trick_seed(base::sample) > rbinom <- trick_seed(stats::rbinom) > > (https://github.com/romainfrancois/evil.R/blob/master/R/evil.R) - Attachment (GitHub): romainfrancois/evil.R > Evil tricks for R. Contribute to romainfrancois/evil.R development by creating an account on GitHub.

Aaron Lun (09:56:08): > Yes, it is for purposes of reproducibility. All my scripts haveset.seed()calls preceding various random functions.

Kevin Rue-Albrecht (10:00:06): > (while you obviously don’t want to use the code above as is, it might give you pieces that you can reuse?)

Aaron Lun (10:01:16): > It’s something like that, but I remember there is some complication.

Kasper D. Hansen (10:13:51): > NONONONONONO

Kasper D. Hansen (10:13:58): > NEVER SET A SEED INSIDE A FUNCTION

Kasper D. Hansen (10:14:34): > AT LEAST NEVER NEVER NEVER IN A PACKAGE

Kevin Rue-Albrecht (10:15:06): > Indeed, better plant it in a pot and water it. Put at sunlight, not in a package.:seedling:

Aaron Lun (10:16:28): > I was thinking something along the lines ofBiocParallelParam‘sRNGseedsetting capabilities.

Kasper D. Hansen (10:16:56): > That sounds scary

Kasper D. Hansen (10:17:00): > I will have to review that

Kasper D. Hansen (10:17:15): > Especially random numbers and parallel operations are non trivial

Kasper D. Hansen (10:17:36): > Because there is typically no gurantee that streams with different seeds are independent

Kasper D. Hansen (10:17:55): > Which is an extremely common error to make

Aaron Lun (10:23:32): > Each worker should have its own stream, from whatparallel::clusterSetRNGStreamsays.

Aaron Lun (10:23:37): > But anyway, we’re getting off the point.

Aaron Lun (10:24:33): > If it’s good enough forBiocParallelParam, why shouldn’t I provide an option to specify seeds in my parameter classes that perform random calculations?

Chen Meng (11:08:16): > @Chen Meng has joined the channel

Aaron Lun (11:36:57): > Moving on: I discovered thebpstartandbpstopcommands. Is there any benefit from doing something like: > > if (!bpisup(BPPARAM)){ > bpstart(BPPARAM) > on.exit(bpstop(BPPARAM), add=TRUE) > } > > in my functions that accept aBPPARAMargument? I assume this starts up the backend (not entirely sure what that means) to avoid paying the start-up costs multiple times if I need to use multiplebp*applyfunctions within my own function?

Martin Morgan (11:39:50): > yes that idiom avoids the cost of starting the cluster. Also, the worker state is re-used across the function, so for instance the cost of loading GenomicFeatures is paid once.

Martin Morgan (11:46:03): > FWIW BiocParallel does not call set.seed() directly; it is called indirectly when a SnowParam (including MulticoreParam) cluster withRNGseednot NULL starts viaparallel::clusterSetRNGStream(); the man page and parallel package vignette discuss use / consequences in more detail.

Aaron Lun (14:37:51): > Okay. That’s fair enough. After thinking about it, I’ve reverted back to my original position on this matter (https://support.bioconductor.org/p/110439/).

Aaron Lun (14:45:08): > Okay, final dev question of the day. I have a few internal S4 classes for which I have defined%*%,dim,dimnames, etc. These are not explicitly exported by myNAMESPACE, but it seems they show up inshowMethods("%*%")and causeCHECKto be unhappy as they don’t have any documentation. Any thoughts on how to resolve this? I can hardly write documentation for methods for classes that shouldn’t be visible to the user!

Peter Hickey (19:23:04): > accidentally just learnt you can stick aSummarizedExperimentin the assay slot of anotherSingleCellExperiment:exploding_head:

Peter Hickey (19:23:11): > :turtle::turtle::turtle:all the way down

Shian Su (19:36:22): > I have written a function that takes two lists of functions and returns a list of the tensor product with composition as the operator. So the cartesian product of the two lists where the pairs of functions(f, g)are also composed tof o gand the elements are returned as a list.

Shian Su (19:36:34): > Now I come to the hardest part of programming: naming things, any suggestions for something less esoteric thancomposition_tensor_prod(fn_list1, fn_list2)?

Levi Waldron (19:38:48) (in thread): > I’ve learned the same lesson withSummarizedExperiment(eset)

Levi Waldron (19:39:26) (in thread): > I guess it falls under the “with great power comes great responsibility” heading

Shian Su (20:08:58): > Perhapsfn_outer_prod(fn_list1, fn_list2)sounds benign enough while still being accurate.

Shian Su (22:50:58): > Is the use oftibblefrowned upon overDataFramefor a potential Bioconductor package? I have list-column containing matrices that will print much nicer intibbleform.

2018-10-24

Hervé Pagès (01:26:24): > @Aaron LunAre you sure you don’t you have something likeexportMethods(dim)in your NAMESPACE? This exports alldimmethods, even those defined for unexported classes. I don’t know a way of exporting specific methods only.

Hervé Pagès (02:04:15): > @Shian SuDataFrameis generally preferred overtibble. Maybe we can try to improve how yourDataFramedisplays. Please open an issue on GitHub under S4Vectors to describe your use case so Michael or I can take a look at it. Alternatively, if all your columns are lists of matrices, you could also use amatrixinstead of atibbleor aDataFrame. Would be amatrixof matrices i.e. amatrixof typelistwhere each matrix element (m[[i, j]]) is itself a matrix. Display would probably not be as cute astibblebut maybe decent enough (and better thandata.frameorDataFrame).

Shian Su (02:10:53): > Just 1 column would be a matrix, I’m trying to write a benchmarking package and the data model I want to go with is something like this:

Shian Su (02:11:14): - File (Plain Text): Benchmarking tibble

Shian Su (02:17:57): > The idea is to be able to compose further operations onto this object, so for example alistoffunctions that compute metrics can be applied via a magic function to this tibble and it would acquire a new column called “metric” and result would be mutated to reflect each combination of “method” and “metric”.

Shian Su (02:28:06): > I know aboutSummarizedBenchmarkbut I think it’s mostly focused on single datasets with many single-step methods. I’m reluctant to use it for now because I’d have to make a list ofSummarizedBenchmarkand do a lot of wrangling to end up with a nice flat structure at the end. I think the tibble or potentially DataFrame is sufficiently informative that I don’t need to leverage more complicated structures.

Federico Marini (03:11:41) (in thread): > Inception!

Aaron Lun (05:57:08) (in thread): > Not as far as I know.

Lauren Fitch (15:46:02): > I remember seeing at Bioc-2017 there was a package demonstrated that contained pre-processed scRNA-Seq data. does anyone remember such a package?

Kevin Rue-Albrecht (15:46:22): > scRNAseq?

Lauren Fitch (15:48:34): > that might be it, thank you!

Vince Carey (15:49:57): > there is also a collection of preprocessed single-cell studies athttp://imlspenticton.uzh.ch:3838/conquer/

Kevin Rue-Albrecht (15:50:34) (in thread): > No worries! It’s most likely that one, as far as I’m aware.

Aaron Lun (17:22:08): > and alsoDuoClustering2018.

Aaron Lun (17:22:30): > AndTENxBrainData, andTENxPBMCData.

Martin Morgan (17:47:23): > maybe there is scope for updating biocViews terms to include the ‘SingleCellData’ term …http://bioconductor.org/packages/release/BiocViews.html#___SingleCellData

Levi Waldron (22:33:01) (in thread): > There’s also conquer:http://imlspenticton.uzh.ch:3838/conquer/

Levi Waldron (22:33:23) (in thread): > Oops, saw that Vince already mentioned conquer!

2018-10-26

Michael Love (15:31:36): > What channel do I use to humblebrag on new Bioc Stickers :)

Michael Love (15:33:49): > #sticker_joy

2018-10-28

Stephanie Hicks (07:40:16): > :joy:

Kevin Rue-Albrecht (07:44:38) (in thread): > Can’t wait to get my hands on a SummarizedExperiment now:smile:

2018-10-31

Ruizhu HUANG (04:37:43): > @Ruizhu HUANG has joined the channel

Federico Marini (16:15:30): > I wonder being halloween today whether Bioc releases also should have codenames

Federico Marini (16:15:41): > trick or treat would be too obvious in this case

Federico Marini (16:17:17): > “Did your package make it intodynamite plots rejection“?

Federico Marini (16:17:23): > but it would make release events somewhat more memorable:stuck_out_tongue:

Michael Love (17:27:47): > Thanks to the Bioc core team for the work involved in the release!

2018-11-01

Michael Love (08:55:55) (in thread): > No it got pushed back tomarginal significance in subgroup

Federico Marini (16:06:26) (in thread): > In good company withit is significant if we remove that data points, and that as well

2018-11-04

Aaron Lun (08:27:49): > <!channel>Looking for interested volunteers to write a single-cell batch correction package. Happy to kick start the process by moving all MNN-related methods inscranto this new package.

Aaron Lun (11:41:10): > Oh, any~~~sacrifices~~~volunteers should reply to this message right here, and I’ll put together a google docs + repo to start things rolling.

Anthony (11:51:28) (in thread): > I’d like to volunteer! As a background, I’m a bioinformatics software developer

Peter Hickey (16:11:49) (in thread): > what’s the scope beyond moving MNN to its own package? to make existing (non-R?) methods available within BioC?

Aaron Lun (16:15:29) (in thread): > Ideally, yes, though I don’t expect myself to do any of that.

Peter Hickey (16:20:12) (in thread): > i’ll try to chip in. getting more experiments that need batch correction …

Tim Triche (17:14:26): > How about adapting BBKNN

Tim Triche (17:14:32): > It’s straightforward

Tim Triche (17:14:49): > I was considering porting it to R

Aaron Lun (17:15:03): > Sure, if you want to make a PR to this hypothetical repo, be my guest.

Tim Triche (17:15:23): > I thought the idea was to bolt it on to scran?

Tim Triche (17:16:19): > Is there a particular preference for eg SingleCellExperiment ca other data structures

Aaron Lun (17:16:54): > 1. No, it will be a separate repo, precisely becausescranis getting far too bloated. Too many concepts fighting for my headspace when I look at that package.

Aaron Lun (17:17:27): > 2. SCEs will be our basic data structure, with possibly MAEs depending on how we play it.

Stephanie Hicks (20:50:53) (in thread): > happy to help here too

Shian Su (21:35:41): > Writing a C++esqueusing()function, does anyone want to tell me why this is an obviously terrible idea?

Shian Su (21:37:47): - File (R): using() proposal

Shian Su (21:40:35): > I don’t know if this looks more or less confusing thancounts <- SingleCellExperiment::counts

2018-11-05

Aaron Lun (04:19:42): > It’s probably a bad idea.

Aaron Lun (04:19:49): > Though I don’t really know what you want to do.

Aaron Lun (04:19:59): > Why don’t you justimportMethodsFrom(SingleCellExperiment, counts)?

Shian Su (04:58:05): > Mainly for identically named functions from different packages, and to have a local explicit statement of where I’m using it from.

Aaron Lun (04:58:26): > Generics shouldn’t care.

Shian Su (05:00:04): > Real use case is not generics, just picked on SingleCellExperiment because it’s an example of a namespace I don’t want to type out every time.

Aaron Lun (05:01:33): > Your cure seems worse than the original problem.

Shian Su (05:03:51): > Maybe, that’s why I wanted opinions.

Peter Hickey (06:33:17): > not sure I understand either, but is this whatconflicted::conflict_prefer()(https://conflicted.r-lib.org/) is supposed to help with? there’s a few other strategies described in its README - Attachment (conflicted.r-lib.org): An Alternative Conflict Resolution Strategy > R’s default conflict management system gives the most recently loaded package precedence. This can make it hard to detect conflicts, particularly when they arise because a package update creates ambiguity that did not previously exist. ‘conflicted’ takes a different approach, making every conflict an error and forcing you to choose which function to use.

Aaron Lun (08:46:50): > All interested parties for batch correction, please head to#sc-batch-correction@Peter Hickey@Anthony@Federico Marini@Tim Triche@Stephanie Hicks

Koen Van den Berge (15:57:06): > @Koen Van den Berge has joined the channel

Shian Su (18:22:53) (in thread): > Thanks Pete, I remember hearing about this at useR but totally forgot it existed. The point was to allow a package to use identically named functions from different packages in different functions with a local statement of what was used. So I don’t want to commit the whole package to a single package’s function, and I don’t want to pick one to be the default and other to be namespaced, for exampledplyr::select()andbiomaRt::select(). The “best practice” is to just namespace everything, but it gets a bit tedious, so I was thinking up ways to clean it up a little.

2018-11-06

Avi Srivastava (10:36:30): > @Avi Srivastava has joined the channel

Rory Kirchner (10:47:21): > @Rory Kirchner has joined the channel

2018-11-09

Shian Su (00:35:16): > Any non-standard evaluation wizards here that can tell me whether the following is sufficient to suppressprint()that people like to use in place ofmessage?

Shian Su (00:35:27): - File (JavaScript): Untitled

Martin Morgan (06:39:08): > isn’t it better to get the people that like to use print not to do it?

Kasper D. Hansen (20:22:26): > Perhaps a little guide which lays out the way to do it, there are a lot of options.

Kasper D. Hansen (20:22:57): > Also, not that I think it is universally useful, I often haveverbosebeing settable to an integer for extra verbosity

2018-11-11

Shian Su (20:05:31) (in thread): > I’ve discovered an terrible side-effect to my code. Anything inside my suppressPrint() function cannot be debugged properly as the print is captured. Looks like oppression is not the answer today and I’m going to have to convince people of the evils ofprint()misuse.

2018-11-12

Michael Love (12:30:51): > In interfacing with some Salmon files (the Gibbs or bootstrap replicates), we’ve been usingcon <- gzcon(file(filename, "rb"))andreadBin(con, ...), and are also considering this for a new type of file (the matrix produced by alevin, a single cell quantification method). We are now considering if we should store the files differently. Does anyone have any feedback on the portability of this storage method over alternatives? (caveat is I have very little experience with these storage and portability questions)

Kasper D. Hansen (12:38:26): > A custom binary format will give you full control and potentially amazing performance, but you’re responsible for everything working. A common format will give you less control, usually less performance, but reduce your maintainence (and potentially development) overhead. I would tend to go with a common format, although that is of course going against the tradition of Bioinformatics as “the science of converting between file formats”.

Kasper D. Hansen (12:38:51): > Is this data where multiple processes need to be able to read and/or write simultanously. That’s hard.

Michael Love (12:54:23): > @Avi Srivastava@Rob Patromulti-read/write?

Shian Su (18:28:28): > I think you generally want to stick to common formats unless you can show in benchmarks a significant performance improvement in the areas you care about. If it’s like <20% then the common formats could catch up in a few years from library updates or even just using different compression settings. Personally I compress fastq files at level 3 or 4 which takes 10% more space than level 7/8 but on my machine is twice as fast to read. There’s also third party support like pigz which gives you free parallelism.

Shian Su (18:38:39): > With regards to portability the big tripping point is probably endianness. You can see htslib having issues with it “recently”https://github.com/samtools/htslib/pull/99 - Attachment (GitHub): Support for mips/mipsel by azlicic · Pull Request #99 · samtools/htslib > Fix for issue #98 Avoid unaligned memory access on architectures that don’t support it. Fixed endianness related issues.

Pariksheet Nanda (18:56:56): > @Pariksheet Nanda has joined the channel

Rob Patro (21:29:25): > > Is this data where multiple processes need to be able to read and/or write simultanously. That’s hard. > We’ve not had a need for this yet

Rob Patro (21:29:34): > We have multiple threads contributing output

Rob Patro (21:29:51): > but the writing to the gzipped stream is controlled by a mutex — so it’s safe

2018-11-13

Michael Love (07:44:17): > While we’ve had this code in tximport, it’s for the Gibbs replicates which I think not many users are generating yet, so there’s not much exposure to determine how wide scale the endian problem might be

Avi Srivastava (08:13:26): > I’d argue wrt current Alevin/Salmon usage the scale of impact might not be too high. I was also wondering how was this handled for BAM files but it looks like the issue for samtools which@Shian Suforwarded is for ARM (may be raspberry pi). In my opinion, we should definitely keep an eye on the issues wrt the binary parser but it’s a long shot for a user to use Alevin on raspberry pi.

Michael Love (08:14:23): > Haha

Michael Love (08:15:02): > Ok maybe proceed with current implementation. Thanks for advice Kasper and Shian

Rob Patro (08:15:37): > One random thought

Rob Patro (08:16:08): > A limitation of the current matrix format is lack of random access / reads

Rob Patro (08:16:32): > You have to decompress / load the whole thing to access it.

Rob Patro (08:16:46): > Is that a problematic limitation?

Kasper D. Hansen (08:24:35): > If you intend to use it for analysis, that’s critical. If you intend for the data format to be transient ie. it is piped to another processor which then summarizes the data for analysis, it doesn’t matter

Avi Srivastava (08:29:05): > I’d like to add, it indeed can be criticalifthe sparse matrix size is too big to handle (which obviously can increase very fast given we have 1.3M cells data) but currently the matrix size of the biggest dataset of 8k cells with some ~1Billion reads on 10x website is of size ~100mb.

Michael Love (08:29:54): > We are imagining that the alevin file gets piped to something like SingleCellExperiment for analysis I think

Michael Love (08:32:57): > Worst case if the current implantation doesn’t fit our needs in the future you can change the file format and tximport could detect which format it finds. So somewhat invisible to the user

Kasper D. Hansen (08:35:09): > @Avi SrivastavaBut that format does not allow random access, I believe. Or at least not easy random access.

Kasper D. Hansen (08:35:48): > I think I mean fast random access

Avi Srivastava (08:42:17): > I agree about the random access part, as@Rob Patrowas saying we might have to load the matrix first but in my experience (which is limited to 10x and DropSeq) the size of the matrix is not too big to start with. However, I agree that it’s definitely worth thinking about random access as the size and scale of the single cell data is increasing dramatically.

2018-11-14

Aaron Lun (06:43:10): > If you’re dealing with droplet data, there might be some synergies to be had with theDropletUtilspackage. I don’t really know what you’re planning, but if you’re talking about a droplet-specific data structure, I’d be interested in putting some development effort intoDropletUtilsto match it.

Rob Patro (07:43:16): > :+1:

Michael Love (09:15:39): > I’ve put a minimal reader into tximport (based off of Avi’s code) which puts alevin into the tximport framework (txi$counts). Now I’m going to work on producing a SE-style object in tximeta, but I could use a more specific container like SCE

Rory Kirchner (10:49:47): > Awesome

Michael Love (10:56:04): > I’ll head back to#salmon2biocfor updates now. if anyone is interested feel free to join/post there

Aedin Culhane (14:44:20): > Hi Anyone know of a OMIM -> Tissue (Uberon or Cell Onology) mapping or what approach would be easier to extract this mapping

2018-11-15

Aaron Lun (10:50:36): > Just overtook@Sean Davisfor 4th spot on the BioC support site ranking.:party_parrot::party_parrot:

Martin Morgan (10:52:38) (in thread): > let us know when you pass James… (ok, a little defensive, seeing the writing on the wall)

Dane Gellerup (14:31:57): > @Dane Gellerup has joined the channel

Lucas Schiffer (15:00:51) (in thread): > It must have been the new cat avatar that put your answers over the top!

2018-11-16

Hervé Pagès (19:42:35) (in thread): > Beware that the first 3 spots must go thru doping control at every conference!

2018-11-18

Michael Love (15:05:39): > Great to see the names of AnVIL investigators from here! Will this award be used to develop / expand on the ’Hubs? Just curious what’s in store

Stephanie Hicks (15:07:01): > @Michael Lovesorry i must of missed it. Who are the AnVIL investigators??

Michael Love (15:07:26): > https://www.genome.gov/27569268/genomic-analysis-visualization-and-informatics-labspace-anvil/

Michael Love (15:07:51): > > Department of Biology, Johns Hopkins University: James Taylor (contact PI), Jeffrey Leek (PI), Michael Schatz (PI), Enis Afgan (co-I), Kasper Hansen (co-I) > > Department of Biomedical Engineering, Oregon Health & Sciences University: Jeremy Goeks (PI), Kyle Ellrott (co-I) > > Huck Institute of the Life Sciences, Pennsylvania State University: Anton Nekrutenko (PI) > > Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute: Martin Morgan (PI) > > Department of Medicine, Brigham & Women’s Hospital: Vincent Carey (PI) > > Institute for Implementation Science in Population Health, City University of New York: Levi Waldron (PI)

Michael Love (15:08:20): > That’s the data science piece

Stephanie Hicks (15:09:17): > ooooh yes I did hear about that, but didn’t connect the acronym “AnVIL”:sad-parrot:

Martin Morgan (16:13:09): > Probably the initial efforts will be to play well with the emerging AnVIL architecture, so I imagine a package implementing data discovery & access in the context of AnVIL, rather than our Hubs. Also I imagine that work will develop ‘scalable’ AnVIL computing, which might translate into ‘best practices’ for incorporation of R tools into workflow description language and conversely exploiting highly scalable services that might become available (analogous to say the use of BigQuery to navigate the TCGA).

Martin Morgan (16:15:44): > One thing about the Hubs that I find interesting is that our current, file-based, approach is not that different from the sort of access discussed in the context of AnVIL, for instance there is no attempt in AnVIL to make a ‘big database’ of, e.g., called variants for flexible range-based queries, ‘just’ a collection of e.g., VCF files that could be processed by various tools.

Michael Love (18:15:41): > The one-big-database advocates got stuck discussing format?:rolling_on_the_floor_laughing:

Michael Love (18:22:16): > The Hubs are very useful for data discovery, so more things in that vein, newer datasets prepackaged to be used in Bioc, sounds great. Hubs are great also for teaching: less steps to getting real data and playing with it

2018-11-19

James Taylor (13:30:36): > @Levi Waldronand I were discussing something like hubs but driven by metadata (in parseable files of course). We would potentially use that for generating query UIs and such.

James Taylor (13:30:47): > one-big-db bad, but need some kind of schemas:wink:

Sean Davis (13:39:56): > @James Taylorand@Levi Waldron, I’d be interested in talking more about data models/schemas. Levi and I have discussed this a bit with respect to curatedMetagenomicData. There are several other datasets that could serve as use cases, including those that will ultimately go into AnVIL.

Levi Waldron (13:59:51): > I understood from the kick-off meeting that their top priorities are around a few massive data-generating projects, although I’d like to make our curated Hub resources sharable through the AnVIL too.

Levi Waldron (14:00:08): > @Sean Davisand@James Tayloryes I’d love to discuss with you.

Michael Love (16:35:08): > the query UIs are great, i used these a lot in a Comp Bio course to show, ok so in the lecture we worked with IMR90, but look at all these other files you could have used, and you can query this either programmatically or with the popup HTML

2018-11-21

sven (10:14:19): > @sven has joined the channel

Fabiola Curion (12:47:03): > @Fabiola Curion has joined the channel

2018-11-22

Aaron Lun (14:18:46): > Has there been a recent change to the win 64 builders?

Aaron Lun (14:19:03): > I did nothing toBiocNeighbors(and its upstream packages didn’t change either) but yet…

Aaron Lun (14:19:31): > ARGGHH - File (PNG): Pasted image at 2018-11-22, 7:19 PM

Martin Morgan (14:47:43): > At the top ofhttp://bioconductor.org/checkResults/3.9/bioc-LATEST/you can see that R was updated quite recently 2018-11-18 and of course the dependent package versions can change. Also one might be suspicious of rounding error for numerical routines. What’s the unit test doing? (this is mostly a rhetorical question since I won’t be able to look at this in any detail for several days…)

Aaron Lun (14:58:08): > It tests for my usage of the Annoy library vs RcppAnnoy’s usage of the same libraries. Annoy is a approximate nearest neighbor search algorithm, provided as a header-only C++ library within RcppAnnoy. Both RcppAnnoy and BiocNeighbors use the exact same Annoy header files during compilation of our respective C++ code. So the only difference between the packages is how we call the Annoy C++ functions. But I’m pretty sure these were working last week, and neither Dirk nor I have changed anything since.

Aaron Lun (15:15:49): > The funny thing is that it’s only failing for a very small subset of points, so there’s nothing grossly wrong.

Aaron Lun (15:20:07): > The most likely candidate is something broken with the memory mapping on windows… I wouldn’t be surprised.

Aaron Lun (15:20:31): > But then again, that shouldn’t have changed in the past week, nor should it be affected by any changes in Rdevel.

Martin Morgan (15:22:39): > failing to initialize variables hence reading arbitrary values is a typical problem in code that has intermittent failure, as I’m sure you’re aware; it’s probably the most common non-obvious development issue identified by cross-platform builds

Aaron Lun (15:23:55): > Hm. I guess I’ll throw it into valgrind tomorrow and see what pops up.

Aaron Lun (15:24:17): > I’d be surprised, but I suppose these bugs are always surprising.

Aaron Lun (15:25:04): > Everyone else’s Windows errors for the latest build seem to be connection related.

2018-11-23

Aaron Lun (10:07:44): > Hm. Bemusing. A stripped down version builds and checks on Rhub’s Rdevel windows builders.

Aaron Lun (10:12:59): > I wonder if I could try it out on the Single Package builder?

Aaron Lun (10:26:29): > R CMD check’ingBiocNeighborswith valgrind on linux is 100% clean.

Aaron Lun (12:10:56): > Anyone who wants to try it out:https://github.com/LTLA/AnnoyWinFail - Attachment (GitHub): LTLA/AnnoyWinFail > A dummy repository for testing out the failure of the BiocNeighbors with Annoy linkage on the BioC Windows machines. Damn these Windows machines. - LTLA/AnnoyWinFail

Aaron Lun (17:55:14): > The other possibility is that my usage of the Annoy library writes the index to file, whereas the RcppAnnoy call does not. If the file was somehow corrupted for some reason, you would get differences between the two calls.

Aaron Lun (18:35:38): > Well, it failed again today, so it doesn’t look like a one-off issue.:confounded:

Aaron Lun (19:32:05): > I wonder whether it would be possible to have a Bioconductor/Debugging repository where developers can raise issues that will trigger jobs in the single package builder, in the same way that Bioconductor/Contributions triggers jobs for new submissions. This would allow us to reproduce BioC build errors exactly, especially for these machine-specific bugs.

2018-11-24

Martin Morgan (12:23:55): > Using the SPB would be great, if the SPB were exactly like the daily builders, but unfortunately they are not, and one might at least sometimes go down a rabbit hole rather than identifying the problem…

Aaron Lun (12:30:25): > Well, I am flummoxed.

Aaron Lun (12:35:34): > I guess a valgrind check on the windows machines might be informative.

Aaron Lun (12:36:22): > I could also modify the code to avoid making the memory mapped file - this would confirm whether the mmap is the issue.

Aaron Lun (12:40:15): > I’d rather not do that on themasterofBiocNeighborsitself, though.

2018-11-26

Lada Koneva (08:20:42): > @Lada Koneva has joined the channel

James MacDonald (10:07:07) (in thread): > Wait. Is that what the blood test was for?

2018-11-27

Qian Liu (14:22:20): > @Qian Liu has joined the channel

2018-11-28

Ludwig Geistlinger (14:39:33): > https://www.biorxiv.org/content/early/2018/10/25/452532 - Attachment (bioRxiv): A comprehensive analysis of the usability and archival stability of omics computational tools and resources > Developing new software tools for analysis of large-scale, biological data is a key component of advancing computational, data-enabled research. Scientific reproduction of published findings requires running computational tools on data generated by such studies, yet little attention is presently allocated to the usability and archival stability of computer code encapsulated as computational software tools. Scientific journals require data and code sharing, but none currently require authors to guarantee software usability and long-term archival stability of newly published tools. We developed an accurate estimation of the accessibility of computational biology software tools by performing an empirical analysis of usability and archival stability of 24,490 omics software resources published from 2000 to 2017. We found that 26% of all omics software resources are currently not accessible through URLs published in the paper. Among the tools selected for our comprehensive and systematic usability test, 49% were deemed “difficult to install,” and 28% of the tools failed to be installed due to problems in the implementation. Moreover, for papers introducing new software, we found that the number of citations significantly increased when authors provided an easy installation process for published software. We propose for incorporation into journal policy several practical solutions for increasing the widespread usability and archival stability of published bioinformatics software.

Lucas Schiffer (15:20:54) (in thread): > And what could be easier thanBiocManager::install()?

2018-11-29

Aaron Lun (05:32:31): > ARGH 10X changed the format of all of their files.

Kevin Rue-Albrecht (06:18:45): > ?!?

Kevin Rue-Albrecht (06:19:02): > web link or didn’t happen

Aaron Lun (06:24:21): > https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/output/matrices

Aaron Lun (06:24:25): > See the blue box.

Aaron Lun (06:24:31): > And that’s the easiest part.

Aaron Lun (06:25:12): > For example,TENxMatrixis probably broken, because the row indices inindicesis now zero-indexed instead of being one-indexed as before.

Kevin Rue-Albrecht (06:28:54): > Thanks for sharing the info!:+1:

Aaron Lun (06:34:31): > Actually, it looks like TENxMatrix still works.

Kevin Rue-Albrecht (06:34:59): > What doesn’t break it… nevermind

Aaron Lun (06:35:14): > But many, many things inDropletUtilswill break with 3.0.

Aaron Lun (06:56:02): > On that note, is there a way to gzip an existing file in base R? Without importingR.utils::gzip?

Kevin Rue-Albrecht (07:03:17): > CanGEOquery::gunziphelp? The code looks simple enough to “borrow”

Kevin Rue-Albrecht (07:04:26): > Actually, I just noticed the “Details” section of the man page: > > This function was stripped out of R.utils due to breaking some stuff on the bioconductor build machine.

Kevin Rue-Albrecht (07:06:08): > Oh wait, it’s the opposite process (gzip) that you asked for. Sorry! Seems like onlygunzipgot re-packaged inGEOquery

Sean Davis (07:13:58): > I don’t think there is something in base R to do gzip. That GEOquery stuff is ancient, so there may be better alternatives at this point.

Sean Davis (07:15:41): > I have no idea how much work would be involved, but the pigz library seems like it might be useful for fast gzip compression and decompression.https://github.com/madler/pigz - Attachment (GitHub): madler/pigz > A parallel implementation of gzip for modern multi-processor, multi-core machines. - madler/pigz

Aaron Lun (08:09:12): > Hm.

Aaron Lun (08:09:21): > Well, I’ll just use R.utils for now.

Shian Su (08:15:52): > What’s the reason for not wanting to use it?

Aaron Lun (08:16:56): > Nothing really, I just didn’t want to drag in another imports for a single function.

Aaron Lun (08:17:07): > … if there was already something in base. But there isn’t, so I’ll just do it.

Aaron Lun (09:16:15): > Well. It’s done.

Aaron Lun (09:16:30): > All I/O functions in DropletUtils will automatically switch between version 2/3 output.

Jacob Munro (17:22:42): > @Jacob Munro has joined the channel

Carmel Maher (17:27:11): > @Carmel Maher has joined the channel

Shila Ghazanfar (17:40:01): > @Shila Ghazanfar has joined the channel

Xueyi Dong (19:21:46): > @Xueyi Dong has joined the channel

Luyi Tian (21:38:35): > @Luyi Tian has joined the channel

Peter Hickey (22:18:58): > @Aaron Lunis getting a huge shoutout and praise for his tremendous work in Bioconductor from@Stephanie Hicksin her keynote at #BioCAsia in Melbourne. > A huge thanks from all of us that benefit from your incredible work, Aaron!

Martin Morgan (22:29:37): > For the question ‘does he sleep’, (https://twitter.com/BelindaPhipson/status/1068339789928128512) searching this slack reveals probably yes, for instance on Sept 6https://community-bioc.slack.com/archives/C8BJLSP8T/p1536220598000100 - Attachment (twitter): Attachment > Does Aaron Lun ever sleep? https://www.bioconductor.org/packages/release/workflows/html/simpleSingleCell.html #BioCAsia #abacbs2018 - Attachment: Attachment > So I went to sleep with earplugs.

2018-11-30

Kevin Rue-Albrecht (04:54:50): > I could swear I once used a function to convertseqlevelsbetween Ensembl and UCSC, I’ve looked throughrtracklater,GenomicFeaturesandGenomicRanges, but can’t find it again.

Kevin Rue-Albrecht (04:55:36): > GenomeInfoDb::seqlevelsStyle()!!!

Kevin Rue-Albrecht (04:55:47): > Praise the almighty Google

Kevin Rue-Albrecht (04:56:26): > (sorry@Aaron Lun, he’s still faster than you on occasion:stuck_out_tongue:)

Federico Marini (08:25:13): > Does google also sleep?:joy:

Kevin Rue-Albrecht (08:28:44): > moving this to#random

Aaron Lun (09:56:57): > Ha!

2018-12-04

Aaron Lun (09:09:40): > @Martin MorganBack to ourBiocNeighborswindows check discussion, this thing pops up: > > Error in x$.self$finalize() : attempt to apply non-function >

Aaron Lun (09:16:16): > Whether or not this is the cause of my actual problem is debatable.

Aaron Lun (09:22:13): > My prime candidate for the cause of the discrepancy just got ruled out by the latest build, so I’m down to compilation flags and optimization levels.

Aaron Lun (09:24:11): > tokay is running O3 while the other two machines are on O2. Don’t know if that makes a difference or not.

Aaron Lun (10:02:47): > Wait, hold on. I’ve been usingintwhile RcppAnnoy has been usingint32_t. This might affect the memory mapping on windows.

Kasper D. Hansen (10:28:35): > O3 vs O2 gives an error on Rgraphviz using GCC 8. Just FYI that optimization might matter as well

Aaron Lun (10:31:24): > Though on second look, the 32-bit tokay builds also use O3 and they work fine forBiocNeighbors.

Martin Morgan (11:06:04) (in thread): > what happens is reference classes have finalizers. Finalizers are code chunks that are run when there are no more references to the reference class, for instance at the end of an R session. Unfortunately, the exact timing of when the finalizer runs is not well-defined, and in this case the finalizer is being run after the package code has been detached. A solution is to more aggressively manage the objects that use the finalizer, so that the finalizer is run before the package code is unloaded / write the finalizer so that it does not use package code; I’m not close enough to your package to know what the problem is in greater detail; it could actually be BiocParallel, but also I think Rcpp uses reference classes; you can useoptions(error=debug)to enter the code and figure out what.selfis for some hints.

Aaron Lun (11:13:38) (in thread): > How would I force the finalizer to run?rm()and thengc()?

Aaron Lun (11:17:22) (in thread): > I’m pretty sure that it’s some unfortunate interaction between the Rcpp classes I’m using for testing andtestthat’s tendency to detach packages.

Martin Morgan (11:35:10) (in thread): > yes detaching packages can be problematic; yesrm()andgc(), maybe in.onUnload()(for detaching packages, even though it’s not exactly correct;.onDetach()would leave the references until packages importing your package are done).

Chance Hohensee (12:15:04): > @Chance Hohensee has joined the channel

2018-12-05

Aaron Lun (05:29:41): > Oh GREAT. scran also fails, now on 32-bit windows.

Aaron Lun (05:30:34): > And it fails in an incomprehensible place, too. A high-level test with no other failures in the low-level tests.

Rachael Parks (18:55:31): > @Rachael Parks has joined the channel

Aaron Lun (19:08:42): > And it just gets better. ERROR on unix, TIMEOUT on windows, OK on mac.

Aaron Lun (19:08:49): > … are the build machines okay?

Aaron Lun (19:09:53): > Geez. What a stressful way to start my holiday. Looks like there’ll be plenty to think about on my 24 hour flight.

2018-12-06

Robert Ivánek (07:43:51): > @Robert Ivánek has joined the channel

Ólavur Mortensen (08:04:09): > @Ólavur Mortensen has joined the channel

Stevie Pederson (10:25:38): > @Stevie Pederson has joined the channel

2018-12-07

Mark Robinson (02:59:12): > @Mark Robinson has joined the channel

Francisco Romero (13:24:03): > @Francisco Romero has joined the channel

2018-12-08

Aaron Lun (17:50:45): > Has anyone tried to use compile packages with AVX support?

Shian Su (18:59:57): > Support or requirement? I was under the impression that you usually just write simple AVX compatible loops and let the compiler sort it out.

Aaron Lun (19:01:16): > I have to pass compile flags to clang/gcc to get it to use AVX.PKG_CPPFLAGS=-mavx, to be precise.

Aaron Lun (19:02:01): > This technically requires aconfigureto set up theMakevars… for which autotools apparently has a macro to check for AVX, but I’m not sure how to use it.

Shian Su (19:04:35): > Yeah not sure, I was playing with using brand spanking new C++17 std::filesystem a few months back and it turned out to require different flags under fox and clang. Was too much of a hassle to work out.

Aaron Lun (19:05:22): > It’d be nice if we could get it up and running, it halves the runtime of a nearest neighbours search without cost.

Aaron Lun (19:06:36): > Perhaps this might be a good thing for R itself to check for (in the same manner that it checks for openMP). You would essentially get true (CPU-level) vectorization at the C/C++ level to accompany the conceptual R-level vectorization.

Shian Su (19:06:41): > Is -march=native sufficient?

Aaron Lun (19:08:31): > Possibly. Don’t know how that will interact on other compilers. Or windows.

Shian Su (19:10:42): > Should be fine on gcc and clang. Leave it out for makevars.win and dismiss less mainstream compilers?

Aaron Lun (19:11:09): > Windows builds uses GCC anyway, so that would be okay.

Shian Su (19:15:09): > I think the OpenMP way of handling it would probably be ideal though. Less chance of incompatible binaries.

Shian Su (19:16:23): > Then again I think OpenMP does something fundamentally different to instruction level tuning.

Aaron Lun (19:17:24): > Well, AVX would only work for Intel and some AMD chips, so it’s not exactly portable.

Aaron Lun (19:17:48): > openMP is more standard so I can understand why R config supports it.

Shian Su (19:18:04): > I imagine some weird case like on our WEHI servers where the packages are shared. Then you maybe one of the servers has a chip a few years older than the others and suddenly can’t run a package.

Shian Su (19:18:56): > Should be opt-in somehow like the special OpenMP flags, so people know they are sacrificing portability for speed.

Aaron Lun (19:21:44): > If you have an older chip on a worker node, would the machine instructions in a binary compiled on the head node work at all?

Aaron Lun (19:21:49): > Regardless of AVX or not?

Shian Su (19:27:51): > Assuming they are just x86 instructions without fancy additions I would think so. But compilers and machine instructions are 3 levels of abstraction below what I actually understand.

Aaron Lun (19:28:43): > Hm.

Shian Su (19:29:03): > I think AVX should be fine, been around since 2011. But -march=native might enable a bit too much, so you’re probably going to have to go back to specifically detecting AVX somehow.

Shian Su (19:30:34): > About to go driving and VicRoads says I can’t read GCC documentation and drive. Good luck!

Aaron Lun (19:30:43): > lol

Aaron Lun (19:53:48): > Actually, manually unrolling myeuclidean_distanceloop gives me almost the same speedup.

Shian Su (21:09:14): > For micro-optimisations like this I recommend having a look atquick-bench.com.

Shian Su (21:11:52): > If I recall correctly compiler loop unrolling should be enabled at O3. I know this because the flag enabled is called -floop-unroll-and-jam, and I assume is a dessert at C compiler themed restaurants.

Aaron Lun (21:28:56): > Not sure what I shouldbe looking at here.

Aaron Lun (21:40:40): > Well, anyway, perhaps I spoke too soon. The unrolled distance function was only faster when I sequentially added squared distances for every 1st, 2nd, 3rd, 4th element to separate sum variables (4 in total). Trying to add them to a single sum variable reverted performance back to what we had before. I think with the first approach, the compiler was able to automatically vectorize the addition of every set of 4 elements, while with the second approach, it could not.

Aaron Lun (21:43:51): > This poses a more general problem in that it’s not possible to obtain exactly the same distance if the order of addition changes. The difference in miniscule but it is problematic in my use-case where I’m planning to use the distances in the t-SNE calculation; minor differences in the distances would result in large differences in the output. So, for example, you would get reproducibly different results depending on whether you compiled on a machine with or without AVX… which is not good.

Shian Su (22:22:13): > Quick-bench lets you benchmark different code snippets. Like microbenchmark in R, it’s based on Google benchmark but the web interface lets you easily swap compilers and optimisations.

Aaron Lun (22:23:33): > I don’t see how it avoids the compiler optimizing away things that are known at compile-time

Shian Su (22:25:11): > There are macros in google’s framework that take care of it. Not super sure when it needs to be used.

Aaron Lun (22:25:54): > In any case, when I set up the AVX to enforce the addition order of the squared values, it slows back down again.

Aaron Lun (22:26:01): > So maybe it’s more trouble than it’s worth.

Shian Su (22:26:03): > Why should the addition not be associative? Or is that not what you’re implying?

Aaron Lun (22:26:47): > They’re floats, addition is not associative.

Aaron Lun (22:27:55): > As in, I do not get the exact same numbers down to individual bits. This is important for error-propagating procedures like t-SNE, where errors in the 16th dec place yield completely different results after 1000 iterations.

Shian Su (22:29:18): > Sounds like a good paper to write telling people to never use tSNE.

Aaron Lun (22:29:19): > Perhaps I won’t worry about it and just forcibly recalculate all the distances inscater::runTSNE.

Aaron Lun (22:29:58): > Probablyumapwould have the same issue. Most non-linear dimensionality reduction algorithms probably do, I guess the space is just too lumpy to converge to a single point reliably.

Shian Su (22:32:02): > Personally I’d just explain why this method is unstable, explain the part of the method that IS stable and worth interpreting, and not try to cast the illusion that it’s a stable method.

Aaron Lun (22:34:24): > There is nothing in the method that is quantitatively stable.

Shian Su (22:36:27): > The neighbourhoods should be relatively stable, I know they might not be, but then why are we bothering basing scientific discovery on what is effectively a random number generator?

Shian Su (22:47:17): > If you really want to go down that rabbit hole you might want to check if order of operations in C++ are even guaranteed.

Aaron Lun (22:50:39): > I know they are for a fact, because I get exactly identical output fromRtsnecompared to the originalbhtsnecode. Seehttps://github.com/LTLA/RtsneTestSuite. - Attachment (GitHub): LTLA/RtsneTestSuite > Tests the Rtsne package against the original bhtsne implementation. - LTLA/RtsneTestSuite

Aaron Lun (22:51:14): > And I don’t use t-SNEs for scientific discovery. It’s just a way of generating pretty plots for Figure 1.

Aaron Lun (22:52:53): > In any case, we’ve gone off-topic a bit. The other major reason for wanting exact output is to facilitate rigorous unit tests. Currently, I canexpect_identicalfor the output with different parameter settings. This wouldn’t be remotely possible otherwise due to the error propagation of the algorithm.

Shian Su (22:57:12): > Right, in that case I agree that AVX is probably not worth the trouble.

Aaron Lun (22:59:55): > I should probably forcibly disable AVX for RcppAnnoy to avoid potential cases where some wiseguy sets up-mavxon their personalMakevars.

Aaron Lun (23:01:22): > This would result in differences in behaviour… wait. Holy crap.

Aaron Lun (23:01:40): > Maybe this is the reason whyBiocNeighborsdoesn’t agree withRcppAnnoydespite them both having the same headers.

Aaron Lun (23:02:10): > Tokay2 uses-march=native, which presumably activates AVX forBiocNeighbors.

Aaron Lun (23:02:27): > ButRcppAnnoyis probably taken from CRAN as a pre-built binary, where it might be compiled without AVX.

Shian Su (23:05:10): > I personally would like to thank you for standing at the gates of numerical hell to protect the rest of us from this nightmare.

2018-12-09

Shian Su (15:34:47) (in thread): > Did this end up being the problem?

Aaron Lun (15:37:08) (in thread): > don’t know, hasn’t rebuilt yet.

2018-12-10

Guy Horev (07:58:55): > @Guy Horev has joined the channel

Aaron Lun (19:58:08): > Comparison of tokay1 and tokay2 indicates some differences in compilation flags --O3 -march=native -mtune=nativevs-O2 -mtune=generic.

Aaron Lun (19:59:43): > Is this a change in the BioC or Rtools defaults?

Shian Su (21:21:08): > What are the magnitudes of these discrepancies@Aaron Lun?

Aaron Lun (22:22:26): > Rare and minor, seehttp://bioconductor.org/checkResults/devel/bioc-LATEST/BiocNeighbors/tokay2-checksrc.html

Aaron Lun (22:27:42): > which makes it all the more puzzling.

Shian Su (23:00:51): > So do you know how CRAN actually build their binaries?

Aaron Lun (23:08:01): > Not specifically. Only what I can see on Rhub, but I expect that’s not the same.

Aaron Lun (23:14:31): > Rhub has -O2 and -mtune=native, I think.

Shian Su (23:55:01): > IIRC C++ programmers complain about MinGW containing some ancient versions of GCC

Aaron Lun (23:55:37): > I don’t think that’s the problem, unless Rtools degraded between tokay1 and tokay2.

Shian Su (23:58:56): > I’m generally not a fan of GCC < 5.0

2018-12-11

Shian Su (00:02:05): > I think that was the era where they were implementing C++11 and many bugs crept in. It’s where the wisdom of not using O3 came from. 4.8.2 specifically used to error on some scPipe code because they didn’t follow the C++ spec properly.

Shian Su (00:03:52): > Very annoying because 4.8-4.9 are so widely used today.

Shian Su (00:10:36): > https://stackoverflow.com/a/49119902/3999260gives some context on the state of C++11 in 4.x GCC releases. - Attachment (Stack Overflow): Is it safe to link C++17, C++14, and C++11 objects > Suppose I have three compiled objects, all produced by the same compiler/version: A was compiled with the C++11 standard B was compiled with the C++14 standard C was compiled with the C++17 standa…

Aaron Lun (01:20:38): > Funnily enough it works on the 32-bit library.

Shian Su (01:22:14): > 32-bit of which library?

Aaron Lun (01:22:43): > Compiled in 32-bit mode.

Aaron Lun (01:22:49): > BiocNeighbors; see the check results.

Shian Su (01:25:46): > I see, only thing I know of is int sometimes being 64 bit.

Shian Su (01:27:30): > Might the RNG be different?

Aaron Lun (01:28:26): > I thought of that too, but the RNG seed and integer types are hard-coded.

Shian Su (01:37:27) (in thread): > Can you explain the precision difference between RcppAnnoy and BiocNeighbor to me?

Aaron Lun (01:44:04) (in thread): > If I could, I would have solved the bug.

Kevin Rue-Albrecht (04:15:51): > Time for a #compilation channel?

Aaron Lun (05:11:54): > We could do that.#bioc-builds

Kevin Rue-Albrecht (05:14:49): > better name indeed

2018-12-12

Lorena Pantano (15:26:20): > <!channel>anybody who has experienced with FFPE DNA/RNASeq analysis (or know somebody) would like to give a talk during a satellite meeting in the ABRF conference :https://conf.abrf.org/sw4-use-ffpe-next-generation-sequencing-considerations-bedside-bench?

Christian P. Larsen (16:08:14): > @Christian P. Larsen has joined the channel

2018-12-13

Kasper D. Hansen (09:12:28): > I have worked together with Ben Larman on an assay called LISH for doing RNA in FFPE samples. The proof-of-concept work is extremely promising, but its not like it is commodity (yet). I think Ben could give great talk on the assay-wise issues with FFPE and RNA

Lorena Pantano (10:05:01): > oh! please@Kasper D. Hansen, can you put us in contact?

Kasper D. Hansen (10:57:07): > I will. Could you DM me your email.

Kasper D. Hansen (10:57:14): > The proof-of-concept paper is this

Kasper D. Hansen (10:57:14): > https://www.ncbi.nlm.nih.gov/pubmed/28854731 - Attachment (ncbi.nlm.nih.gov): Multiplexed analysis of fixed tissue RNA using Ligation in situ Hybridization. - PubMed - NCBI > Nucleic Acids Res. 2017 Aug 21;45(14):e128. doi: 10.1093/nar/gkx471.

Kasper D. Hansen (10:57:47): > Ben is great but he is 100% assay-development and biology. He is not what we would call an analysis person

Kasper D. Hansen (10:58:20): > That is what the satelite meeting seems to be about thought (ie assays and not analysis)

Michael Lawrence (12:40:40): > @Michael Lawrence has joined the channel

Michael Lawrence (13:27:25): > If anyone here wants a job helping us at Genentech build cool stuff on top of Bioconductor, please let me know, thanks.

Rena Yang (13:51:43): > @Rena Yang has joined the channel

Wenhua Ren (17:46:22): > @Wenhua Ren has joined the channel

2018-12-14

Konstantinos Geles (Constantinos Yeles) (08:52:36): > @Konstantinos Geles (Constantinos Yeles) has joined the channel

2018-12-16

Lluís Revilla (07:51:31): > @Lluís Revilla has joined the channel

2018-12-17

Vladimir Kiselev (06:59:26): > Hello, > > We have a warning related to defining an S4 object:Warning: undefined slot classes in definition of "MyObject": index(class "Rcpp_ModuleClass")More details are here:https://stackoverflow.com/questions/53814548/rcpp-class-wrapped-in-s4-object-warningCould anyone please help? - Attachment (Stack Overflow): Rcpp Class wrapped in S4 object warning > I have an R package that utilises an Rcpp Module that contains an Rcpp class that is exposed to the R namespace. The package builds OK and runs fine. Here is the S4 object definition: #’ The main…

Aaron Lun (07:16:48): > Collation order?

Vladimir Kiselev (08:07:43): > what is collation order?

Aaron Lun (08:09:00): > If your S4 class is defined before the Rcpp module is loaded, the module class won’t be visible duringsetClass.

Aaron Lun (08:09:27): > R files are collated in alphanumeric order so if you want to change this you need to setCollate:in theDESCRIPTION.

Vladimir Kiselev (08:10:03): > ok, many thanks, will check now

Vladimir Kiselev (11:11:19): > @Aaron Lundo you have any working example of this thing?

Michael Lawrence (14:13:56): > "Rcpp_ModuleClass"does not appear to be an Rcpp class. Maybe you mean just"Module"? SeegetClasses(getNamespace("Rcpp")).

Michael Lawrence (14:30:13): > Btw, the comment in that SO post is incorrect. Reference classes are S4 classes. It’s interesting how such misconceptions perpetuate.

Michael Lawrence (14:32:42): - File (R): Untitled

Aaron Lun (19:51:52) (in thread): > InteractionSet

2018-12-18

Vladimir Kiselev (05:33:33): > many thanks@Michael Lawrence! This is what we get: > > > library(scfind) > > new('EliasFanoDB') > Error in getClass(Class, where = topenv(parent.frame())) : > "EliasFanoDB" is not a defined class > > new('Rcpp_EliasFanoDB') > C++ object <0x7fc47fe66f50> of class 'EliasFanoDB' <0x7fc47fc8b110> > > getClass(new('Rcpp_EliasFanoDB')) > C++ object <0x7fc47fe6be90> of class 'EliasFanoDB' <0x7fc47fc8b110> >

Vladimir Kiselev (05:37:42): > ModuleClassin the SO question equals toEliasFanoDBfrom the example above

Vladimir Kiselev (05:39:20) (in thread): > thanks Aaron!

Michael Lawrence (12:24:18): > I’m not very familiar with Rcpp, but you probably need to do something to trigger the definition of that module class prior to defining your wrapper class.

2018-12-19

Leo Lahti (04:37:00): > @Leo Lahti has joined the channel

Levi Waldron (14:33:21) (in thread): > Sorry for the slow reply@Lorena Pantano. You might ask Svitlana Tyekucheva, scientist at DFCI in Giovanni Parmigiani’s group.

Levi Waldron (14:35:31) (in thread): > Also, Shuji Ogino’s group has done such work on NHS/HPFS CRC FFPE blocks, not sure who is doing his analysis though.

Levi Waldron (14:36:37) (in thread): > I can put you in touch if helpful.

Lorena Pantano (14:45:18) (in thread): > please do! really appreciate this!

Lorena Pantano (14:45:54) (in thread): > @Levi Waldronyou can use my work emaillpantano@hsph.harvard.edu

Levi Waldron (17:16:53): > Some years ago I remember hearing about a measure of how much quantile normalization changed the measurements per sample. ie an outlier sample whose measurements changed a lot in quantile normalization would have a large value, a typical sample would have a small value. Anyone remember the name of that statistic / method?

Federico Marini (17:24:45): > Does this have to do with thequantromethod?

Federico Marini (17:25:29): > That is the closest I could try to get from the description, but maybe it is just something very different

Tim Triche (17:54:50): > That sounds like Quantro based on what I remember from reviewing it. Like an adaptive quartile norm

Stephanie Hicks (20:06:33): > @Levi Waldronquantro is a test to see if there are global changes in distribution of your samples. It uses the quantiles and asks what’s the variation between groups relative to within groups, but takes into account of the whole distribution. If var(between groups) > var(within groups) then you have global changes in your data —> assumptions of quantile normalization (and any other global normalization method) are violated.https://genomebiology.biomedcentral.com/articles/10.1186/s13059-015-0679-0 - Attachment (Genome Biology): quantro : a data-driven approach to guide the choice of an appropriate normalization method > Normalization is an essential step in the analysis of high-throughput data. Multi-sample global normalization methods, such as quantile normalization, have been successfully used to remove technical variation. However, these methods rely on the assumption that observed global changes across samples are due to unwanted technical variability. Applying global normalization methods has the potential to remove biologically driven variation. Currently, it is up to the subject matter experts to determine if the stated assumptions are appropriate. Here, we propose a data-driven alternative. We demonstrate the utility of our method (quantro) through examples and simulations. A software implementation is available from http://www.bioconductor.org/packages/release/bioc/html/quantro.html .

Tim Triche (22:30:54): > so qsmooth then?

Tim Triche (22:31:00): > that was the one I remembered reviewing

Tim Triche (22:31:24): > https://github.com/stephaniehicks/qsmooth - Attachment (GitHub): stephaniehicks/qsmooth > Smooth quantile normalization (qsmooth) is a generalization of quantile normalization, which is an average of the two types of assumptions about the data generation process: quantile normalization …

2018-12-28

Rene Welch (12:41:35): > @Rene Welch has joined the channel

2018-12-30

Evan Biederstedt (14:37:32): > @Evan Biederstedt has joined the channel

2019-01-03

Aedin Culhane (13:12:01): > Is there a R api for Google App Scripts/G-Suite?

Sean Davis (13:16:55): > I don’t think so. Here is where I usually start looking:https://cloudyr.github.io/packages/index.html - Attachment (cloudyr.github.io): the cloudyr project > Here’s the list of R packages that we are currently working on.

Sean Davis (13:21:02): > With a combo ofgoogleAuthRand API documentation, it shouldn’t be too much work to implement basic functionality if you need it.

Aedin Culhane (16:50:32): > Currently checking outhttps://github.com/gsuitedevs/apps-script-samples - Attachment (GitHub): gsuitedevs/apps-script-samples > Apps Script samples for G Suite products. Contribute to gsuitedevs/apps-script-samples development by creating an account on GitHub.

2019-01-04

Francisco Romero (00:17:38): > Somebody know about a bioconductor focused postdoctoral position?

Stephanie Hicks (10:58:35): > Any particular area of genomics@Francisco Romerofor bioconductor software development?

Lori Shepherd (13:44:48) (in thread): > occasionally people will post opportunities on the support site as well -https://support.bioconductor.org/t/Jobs/

Kasper D. Hansen (14:59:50): > Also, is it a PD where you want to do some amount of bioconductor related work or should it exclusively be Bioconductor. I (and many others) have positions of the first kind.

2019-01-07

Amali Thrimawithana (21:44:56): > @Amali Thrimawithana has joined the channel

2019-01-10

Aaron Lun (03:24:43): > Wow, we canblogon the support site.

Aaron Lun (03:25:04): > Gotta think of my first blog topic.

Aaron Lun (03:25:13): > https://support.bioconductor.org/p/116724/

Shian Su (03:29:44): > Another source of Bioc support point income?

Aaron Lun (05:07:06): > I could use it to rant about single-cell analysis. God knows how much I have to complain about.

Lluís Revilla (05:08:45): > I think this is inherited fromhttps://biostars.combut it is discouraged as they got removed from the top bar

Lluís Revilla (05:09:24): > BTW perhaps in the support forum it could make a reference tohttps://cran.r-project.org/package=reprexto make the questions reproducible

Aaron Lun (05:10:44): > I think I’d need a “The opinions in this post are the author’s own and do not reflect the position of the wider Bioconductor community.”

Aaron Lun (05:10:57): > Because oh boy, I have a lot of opinions.

Aaron Lun (05:13:03): > Ah, Martin removed theblogtag…

Aaron Lun (05:13:50): > There goes my weekly “Aaron’s Life” series.

Lluís Revilla (05:14:15): > But it is still shown under the “Post Type*” when asking a question

Aaron Lun (05:14:20): > Where you get a weekly dose of Aaron’s~~~opinions~~~facts.

Aaron Lun (05:14:55): > Gee, some of the post types seem sorta pointless.

Aaron Lun (05:15:04): > What the hell’s a “Page”?

Aaron Lun (05:16:13): > Or a “Forum”?

Martin Morgan (05:17:02): > The support site went through a major overhaul ‘behind the scenes’; there are some regressions that will be addressed, including restricting the types of posts. So you’ll have to find some other forum for your blog pages.

Aaron Lun (05:17:25): > Aw, bummer.

Aaron Lun (05:18:23): > Maybe I could get AAAS to give me a domain name a lahttps://blogs.sciencemag.org/pipeline/

Federico Marini (05:40:24): > You should aim for your space on Forbes a la Steven Salzberg:slightly_smiling_face:

Aaron Lun (05:44:18): > In any case, the answer header is a bit messed up, e.g., “Answer: A: Error in installation of scran”.

Martin Morgan (05:48:27): > Issues should be reported onhttps://github.com/Bioconductor/support.bioconductor.org/issues

Sowmya S Manian (06:14:48): > @Sowmya S Manian has joined the channel

Shiwani (06:17:07): > @Shiwani has joined the channel

Aaron Lun (06:18:39): > Done.

2019-01-11

Aaron Lun (07:47:02): > How are people getting syntax highlighting on their code blocks on the support site? I’m doing ``` but this isn’t working.

Aaron Lun (07:48:02): > Compare my ugly mono-color code blocks in the answer to the cool highlights in the question.https://support.bioconductor.org/p/116740/

Lori Shepherd (07:48:51) (in thread): > what if you don’t include the r? just ```

Aaron Lun (07:50:12) (in thread): > Hm - that sounds plausible. I’ll try the next time I post something. Old Github habits die hard, it seems.

Lori Shepherd (07:52:23) (in thread): > Keep me posted on if that works or not

Aaron Lun (07:54:41) (in thread): > Just went back and edited my answer; yep, it was the “r”.

Aaron Lun (07:54:50): > It’s pretty again! TL;DR - don’t use “r”.

Federico Marini (11:34:20): > cool one@Lori Shepherd!

Aaron Lun (20:35:23): > Anycsawusers on this channel who could do me a favour?

2019-01-12

Francisco Romero (15:30:12) (in thread): > thanks

2019-01-14

Rosalba Giugno (03:53:45): > @Rosalba Giugno has joined the channel

Antonino Aparo (05:06:00): > @Antonino Aparo has joined the channel

2019-01-15

Franziska Haertner (03:43:36): > @Franziska Haertner has joined the channel

2019-01-19

Alexey Uvarovskii (06:42:03): > @Alexey Uvarovskii has joined the channel

Aaron Lun (11:06:46): > Community question:csawhas a user’s guide that cannot be reasonably compiled as part of the build system (takes about an hour). I currently compile it externally and update the PDF in the package repository. This is unappealing because (i) it involves adding a somewhat large (800 kb) binary file to the source code, and (ii) it is not dynamically rebuilt with new package versions. Are there any other options? For example, I could create a workflow package that only exists to compile the user’s guide.

Kasper D. Hansen (11:12:25): > but then the guide would not be part of the package

Kasper D. Hansen (11:13:36): > Anyway, I don’t have a solution. It is in principle the same with tests: for some analysis applications it is an enourmous amount of work to get realistic analysis examples which runs very fast. And it might not be possible. That is a tough issue

Kasper D. Hansen (11:13:48): > I guess solutions are

Kasper D. Hansen (11:15:06): > 1) make everything run fast on small examples > 2) compile the user’s externally > 3) split the user guide into two: one is a guide and the other is a more realistic analysis example. Then the guide could be compiled as part of the package and the (longer) analysis example could be compiled externally either by yourself or as a workflow

Kasper D. Hansen (11:15:23): > All of this is work

Martin Morgan (11:28:01): > external compilation is usually a disaster, for us mere mortals, because of the surprising ease with which bit rot sets in. Thinking of the vignette as illustrative rather than definitive seems to be the way to go, with ExperimentData or workflow packages for more comprehensive analysis. > > Of course there are examples that contradict the mere mortal limitations, like the limma user guide, but the combination of single-minded discipline and focus of both maintainer and package, and limited dependencies really make that the very rare exception.

2019-01-20

Aaron Lun (05:11:29): > I don’t mind putting in a little in-package vignette demonstrating how all of the functions are supposed to work. But - I mean, I’ve alreadygotthe user’s guide written, so I’d like some automated build of it somewhere. Now that all the relevant BAM files are available via an ExperimentData package, it would seem that the best way to go would be to create a separate workflow package for the UG?

2019-01-21

Aaron Lun (05:21:33): > Done.

Kasper D. Hansen (11:34:10): > @Michael Lawrence,@Martin MorganI am being bitten by the fact thatpmax()does not support long vectors. I don’t think I can make a fix to this in base R, but is this something I should ask about on R-devel or not. I guess I am asking about what state long vectors are in. Is this for example just an oversight (ie. would a report be appreciated) or is there no goal of simple functions universally supporting long vectors

Martin Morgan (12:19:25): > I’d suggest asking on R-devel.

Kasper D. Hansen (12:37:56): > done

2019-01-22

Kasper D. Hansen (11:20:27): > There was some interest in someone (me, us) compiling a list of asic functions which do not support long vectors (such aspmax()). This seems a great short term goal. With a list of this, it seems like people are interested in addressing the deficiencies.

2019-01-23

Eliza Duvall (10:31:09): > @Eliza Duvall has joined the channel

2019-01-24

Steve Lianoglou (13:45:22): > @Steve Lianoglou has joined the channel

Ming Tang (19:33:55): > @Ming Tang has joined the channel

F (19:35:29): > @F has joined the channel

Jingxin Fu (19:38:05): > @Jingxin Fu has joined the channel

Kylie Bemis (19:40:42): > @Kylie Bemis has joined the channel

Jiwon Lee (19:43:01): > @Jiwon Lee has joined the channel

2019-01-26

Ying Xu (07:49:53): > @Ying Xu has joined the channel

2019-01-28

Sukalyan Banga (11:20:47): > @Sukalyan Banga has joined the channel

Nathan Sheffield (11:50:35): > @Nathan Sheffield has joined the channel

2019-01-29

Leandro Roser (07:00:51): > @Leandro Roser has joined the channel

Michael Love (16:46:34): > question about BiocCheckGitClone, it doesn’t like.logfiles. But I want to include log files in this experiment data package. Should I rename these files tofile.log.txtto get past the checker?

Hervé Pagès (17:13:12): > IMO BiocCheckGitClone should allow.logfiles as long as they are somewhere underinst/. Any other location is suspect and probably a leftover, especially if invignettes/. In any case, BiocCheckGitClone should report a warning and not an error for this (maybe that’s what it does already, I don’t know).

Michael Love (17:41:29): > This is in inst/extdata and inst/scripts

Michael Love (17:41:49): > http://bioconductor.org/spb_reports/oct4_buildreport_20190129163332.html

Shian Su (18:45:24): > Has anyone assembled any guidance on the various ways to access genomic annotation on Bioconductor and how they differ from each other?

Nitesh Turaga (19:02:41): > Hi@Michael Love, BiocCheckGitClone doesn’t seem to allow,.logfiles as of this current version, > > BiocCheck:::checkBadFiles > function (package_dir) > { > ## Extensions which are not allowd > hidden_file_ext = c(".renviron", ".rprofile", ".rproj", ".rproj.user", > ".rhistory", ".rapp.history", ".o", ".sl", ".so", ".dylib", > ".a", ".dll", ".def", ".ds_store", "unsrturl.bst", ".log", > ".aux", ".backups", ".cproject", ".directory", ".dropbox", > ".exrc", ".gdb.history", ".gitattributes", ".gitmodules", > ".hgtags", ".project", ".seed", ".settings", ".tm_properties") > > ## check in the entire package structure including /inst > fls <- dir(package_dir, ignore.case = TRUE, recursive = TRUE, > all.files = TRUE) > ....... > } > > So, maybe renaming tofile.log.txtlike you said will by pass this.

Michael Love (19:07:41): > ok will do. thank you

Ludwig Geistlinger (19:42:42) (in thread): > You mean something likehttps://bioconductor.github.io/BiocWorkshops/introduction-to-bioconductor-annotation-resources.html? - Attachment (bioconductor.github.io): The Bioconductor 2018 Workshop Compilation > This book contains all the workshops presented at the Bioconductor 2018 Conference

Shian Su (19:48:25) (in thread): > I think a workshop like that one caused me to think “Wow, there are so many ways to get annotation on BioC, which one should I choose?” So I was thinking of a higher level overview to help guide someone through that decision.

Ludwig Geistlinger (19:49:53) (in thread): > You mean like a cheat sheet?

Ludwig Geistlinger (19:51:35) (in thread): > Also:https://www.bioconductor.org/packages/release/workflows/html/annotation.html - Attachment (Bioconductor): annotation > Annotation resources make up a significant proportion of the Bioconductor project. And there are also a diverse set of online resources available which are accessed using specific packages. This walkthrough will describe the most popular of these resources and give some high level examples on how to use them.

Ludwig Geistlinger (19:53:05) (in thread): > Also:https://www.bioconductor.org/help/course-materials/and then typeannotationin thesearchwindow

Shian Su (19:54:23) (in thread): > Thanks! I think the annotation workflow about covers it.

2019-01-30

Jul (00:24:36): > @Jul has joined the channel

2019-01-31

Daniel Sink (18:29:04): > @Daniel Sink has joined the channel

2019-02-01

Juan Monroy-Nieto (18:27:33): > @Juan Monroy-Nieto has joined the channel

2019-02-05

Yoon, Tae-Hyun (10:56:35): > @Yoon, Tae-Hyun has joined the channel

2019-02-06

Nigel Delaney (14:16:14): > @Nigel Delaney has joined the channel

Nigel Delaney (14:23:32): > Quick question for the community here, does bioconductor have any “landing pages” for particular subject areas? In particular, I’ve noticed that there are a lot of packages in bioconductor that can do wonderful things with single-cell data, e.g.https://bioconductor.org/help/search/index.html?q=single+cell/. I’m writing documentation for a website and we’d like to point people to bioconductor, and was hoping to find something intermediate between a general link to the page and an enumeration of all the possible packages.

Nitesh Turaga (14:24:36): > No “landing pages” as such as far as what we use for the definition of landing pages, buthttp://bioconductor.org/packages/release/BiocViews.html#___DifferentialExpressionand other such fields. But BiocViews lists or rather groups packages based on functionality.

Nitesh Turaga (14:25:22): - File (PNG): Screen Shot 2019-02-06 at 2.24.42 PM.png

Nitesh Turaga (14:25:30): - File (PNG): Screen Shot 2019-02-06 at 2.24.53 PM.png

Nigel Delaney (14:30:45): > Ah thank you! I see there is a Assay->SingleCellWorkflow with two entries, I wonder if at this stage it would also make sense to have an AssayDomain->SingleCell category

Nigel Delaney (14:34:32): > Ah, I see there is Technology->SingleCell to cover that

2019-02-07

Jianhong (09:46:32): > @Jianhong has joined the channel

2019-02-08

Aaron Lun (09:15:10): > Woah, BioC package submissions really are an endless stream.

Aaron Lun (09:15:51): > Seems similar to the feeling of working at a post office.

Kasper D. Hansen (09:26:19): > Or following your Github account

Aaron Lun (09:28:52): > ha

Patrick Kimes (13:26:34): > @Patrick Kimes has joined the channel

Stephanie Hicks (16:47:59): > hi@Patrick Kimes!

2019-02-10

Aaron Lun (09:34:18): > What’s the etiquette for posting on the main R mailing lists? I think I’ve found a bug in Matrix, wondering whether I should go toR-helporR-devel.

Aaron Lun (10:22:25): > Well, whatever, it’s on R-help now.

2019-02-13

Vladimir Kiselev (12:03:39): > Is there a repository of compiled Bioconductor packages for ubuntu, something like this for CRAN packages:https://launchpad.net/~marutter/+archive/ubuntu/c2d4u3.5 - Attachment (launchpad.net): cran2deb4ubuntu_3.5 : Michael Rutter > R packages for Ubuntu LTS. Based on CRAN Task Views.

Marcel Ramos Pérez (13:40:41): > Hi Vladimir,@Vladimir Kiselevwe have the BiocViews page:http://bioconductor.org/packages/release/BiocViews.html#___SoftwareIf you’re interested in just a list of software packages, you can go tohttps://git.bioconductor.org/

Vladimir Kiselev (14:00:16): > thanks@Marcel Ramos Pérez, maybe I didn’t explain well. It looks like when I runBiocManager::installsome packages have to be compiled and it can take a lot of time if there are a lot of them. Whereas, when I runapt-get install r-cran-...all of those packages are already precompiled so it is really very fast

Vladimir Kiselev (14:02:20): > so, do you know if there is any repo of pre-compiled bioc packages for ubuntu users?

Marcel Ramos Pérez (15:28:34): > I’m not aware of a repository of pre-compiled packages. I imagine that this would be an issue with mainly Linux users as other platforms use binary packages. You could also try our Bioc Docker containers (https://bioconductor.org/help/docker/) which are shipped with a subset of packages.

Jianhong (17:16:53): > @Vladimir KiselevDid you tried conda? eg. conda install -c bioconda bioconductor-genomicfeatures

Vladimir Kiselev (17:20:58): > yes, I’ve tried r packages in conda and never had a good experience with them. Don’t know what the problem was, but I was catching a lot ofsegmentation faulterrors after loading any library. Also I don’t think conda will solve my issue because resolving conda environment usually takes ages.

Shian Su (17:24:18): > Are you compiling packages in parallel via settingoptions(Ncpu = n)at the moment?

Vladimir Kiselev (17:26:50): > no, I didn’t know about that. The problem might be that I am installing them in a docker image (https://github.com/cellgeni/notebooks-base/blob/acc3a01dc6476c2d2d863e5d3d705d74ac6426e2/Dockerfile#L159) using a free building onquay.ioand not sure they provide more than 1 cpu. Will check that, many thanks

2019-02-14

Martin Morgan (09:14:01) (in thread): > conda / bioconda have improved quite a bit in the last release; one common source of the problem you indicate above was (is?) mixing conda and ‘standard’ package installations; once you opt for conda you’re stuck with it (convenience, but also lagging in devel, perhaps incomplete coverage, …)

Martin Morgan (09:16:01): > It would be interesting to explore providing pre-built binaries in a bucket / image, and simply attaching the bucket / image to your docker container & updating .libPaths().@Levi Waldronand / or@Nitesh Turagamight have pointers to this approach

Martin Morgan (14:40:29): > Who knew that R had an official blog? Here’s an interesting (in a highly technical kind of way) post on ‘staged install’ coming in the not too distant futurehttps://developer.r-project.org/Blog/public/2019/02/14/staged-install/index.html.

Levi Waldron (16:01:31) (in thread): > Agreed. I’d like to work on making pre-compiled libraries based on the Docker images available, to be held and updated daily in a google bucket. I could imagine a couple ways to use them: (a) mounting the volume if you’re on the same cloud provider, or (b) maybe an option or alternative toBiocManager::install()that downloads from the binary library if you’re using a valid Docker image?

Martin Morgan (16:32:20) (in thread): > I’m not really sure what else install.packages does, other than place the package in a particular installation directory; I think something about collating help pages, which might not be relevant for some use cases (e.g., workflows).

Nitesh Turaga (21:47:55) (in thread): > I think an straightforward solution would be a simple R package, which does three things: > 1. Install packages in a temp folder on a machine using a certain version of R > 2. Create bucket on GCS > 3. Sync temp folder with GCS. > > Ideally, it would aim to make these buckets more maintainable, and in some way run a cron job to just keep updating the buckets automatically. > > I’d be interested to hear both of your thoughts on this.

Levi Waldron (22:04:34) (in thread): > I could imagine two options: 1. k8s with a web hook that triggers an install and sync to the GCS bucket, or 2. Local host cron update (in Docker) and sync using the GCS client. There’s also a choice of whether to install from scratch like that, or just mount the bucket and run update(), which would be much cheaper but require something different at release time. I like the property of 1. of being totally hardware independent.

2019-02-15

Vladimir Kiselev (04:23:36) (in thread): > Thanks all for your replies, interesting discussion. I’ve also been told about docker multi-stage builds, where one can copy binaries from multiple docker images to your current image while building it:https://docs.docker.com/develop/develop-images/multistage-build/ - Attachment (Docker Documentation): Use multi-stage builds > Multi-stage builds are a new feature requiring Docker 17.05 or higher on the daemon and client. Multistage builds are useful to anyone who has struggled to optimize Dockerfiles while keeping…

Vladimir Kiselev (04:24:26) (in thread): > example docker file is here:https://github.com/BioContainers/containers/blob/master/interproscan/5.30-69.0/Dockerfile

Priyanka Raina (06:09:16): > @Priyanka Raina has joined the channel

2019-02-16

Huajin Wang (10:33:31): > @Huajin Wang has joined the channel

2019-02-18

Lukas Weber (05:40:25): > @Lukas Weber has joined the channel

Geet (10:24:43): > @Geet has joined the channel

Martin Morgan (13:07:16): > <!channel>Several research groups affiliated with Bioconductor received seed funding to develop software for access and analysis of Human Cell Atlas data. We’re having a ‘virtual symposium’ to summarize our “work in progress” on 20 February, 12pm Eastern time. Please feel free to join! > > More information:https://docs.google.com/document/d/1xjUWJ5-WLFyuAmDPQrHNIzD_gkD4IVxquwZlJrdFCG8/edit?usp=sharing

Shian Su (18:05:34): > https://www.timeanddate.com/worldclock/fixedtime.html?msg=Webcast&iso=20190220T12&p1=179&ah=2for international times

Hani Kim (18:54:51): > @Hani Kim has joined the channel

Diego Diez (21:05:37): > @Diego Diez has joined the channel

2019-02-19

Daniel Huebschmann (03:31:37): > @Daniel Huebschmann has joined the channel

Robert Ivánek (04:09:25) (in thread): > Hi Martin, do you plan to record it? That would be great.

Catalina Vallejos (08:15:03): > @Catalina Vallejos has joined the channel

Martin Morgan (08:56:00) (in thread): > @Robert IvánekYes, but this will be the first larger conference call & recording attempt so…

Johnson Zhang (10:18:25): > @Johnson Zhang has joined the channel

Kasper D. Hansen (11:15:26): > I am trying to track down an issue where I have test errors with BiocSingular (everything updated). I am suspecting it is because of some issue with dependencies.

Kasper D. Hansen (11:15:54): > To do so I am testing dependencies usingBiocGenerics:::testPackage(). Here is surprising output:

Kasper D. Hansen (11:16:32): > > > BiocGenerics:::testPackage("DelayedArray") > automatic block size set to 24 bytes (was 1e+08) > automatic block size set to 40 bytes (was 24) > automatic block size set to 100 bytes (was 40) > > <LOTS OF SIMILAR OUTPUT REMOVED> > > automatic block size set to 30000 bytes (was 100) > automatic block size set to 1e+08 bytes (was 30000) > Error in normarg_perm(perm, dim(seed)) : 'perm' must be an integer vector > Error in validObject(.Object) : invalid class "DelayedAperm" object: > 'perm' cannot be an empty vector > Error in validObject(.Object) : invalid class "DelayedAperm" object: > only dimensions equal to 1 can be dropped > Error in validObject(.Object) : invalid class "DelayedAperm" object: > all non-NA values in 'perm' must be >= 1 and <= 'length(dim(a))' > Error in validObject(.Object) : invalid class "DelayedAperm" object: > only dimensions equal to 1 can be dropped > Error in .normalize_dimnames(dimnames, seed_ndim) : > the supplied dimnames must be a list > Error in .normalize_dimnames(dimnames, seed_ndim) : > the supplied dimnames must have one list element per dimension in the > array-like object > Error in .normalize_dimnames(dimnames, seed_ndim) : > the supplied dimnames must have one list element per dimension in the > array-like object > Error in validObject(.Object) : invalid class "DelayedDimnames" object: > each list element in 'x@dimnames' must be NULL, or a character vector > of length the extent of the corresponding dimension, or special value > -1 > Error in validObject(.Object) : invalid class "DelayedDimnames" object: > each list element in 'x@dimnames' must be NULL, or a character vector > of length the extent of the corresponding dimension, or special value > -1 > Error in seed(x) : > seed() is not supported on a DelayedArray object with multiple seeds at > the moment. Note that you can check the number of seeds with nseed(). > You can use 'seedApply(x, identity)' to extract all the seeds as a > list. > Error in (function (classes, fdef, mtable) : > unable to find an inherited method for function 'is_noop' for signature '"DelayedNaryIsoOp"' >

Kasper D. Hansen (11:17:32): > continued > > Error in seed(x) : > seed() is not supported on a DelayedArray object with multiple seeds at > the moment. Note that you can check the number of seeds with nseed(). > You can use 'seedApply(x, identity)' to extract all the seeds as a > list. > Error in (function (classes, fdef, mtable) : > unable to find an inherited method for function 'is_noop' for signature '"DelayedNaryIsoOp"' > Error in seed(x) : > seed() is not supported on a DelayedArray object with multiple seeds at > the moment. Note that you can check the number of seeds with nseed(). > You can use 'seedApply(x, identity)' to extract all the seeds as a > list. > Error in (function (classes, fdef, mtable) : > unable to find an inherited method for function 'is_noop' for signature '"DelayedNaryIsoOp"' > Error in seed(x) : > seed() is not supported on a DelayedArray object with multiple seeds at > the moment. Note that you can check the number of seeds with nseed(). > You can use 'seedApply(x, identity)' to extract all the seeds as a > list. > Error in (function (classes, fdef, mtable) : > unable to find an inherited method for function 'is_noop' for signature '"DelayedNaryIsoOp"' > Error in seed(x) : > seed() is not supported on a DelayedArray object with multiple seeds at > the moment. Note that you can check the number of seeds with nseed(). > You can use 'seedApply(x, identity)' to extract all the seeds as a > list. > Error in (function (classes, fdef, mtable) : > unable to find an inherited method for function 'is_noop' for signature '"DelayedNaryIsoOp"' > Error in[match.fun](http://match.fun)(OP) : 'NULL' is not a function, character or symbol > Error in[match.fun](http://match.fun)(OP) : > 'list(NULL)' is not a function, character or symbol > Error in get(as.character(FUN), mode = "function", envir = envir) : > object 'not-an-existing-function' of mode 'function' was not found > Error in new_DelayedNaryIsoOp("<=", array(dim = 4:2), array(dim = 2:4)) : > non-conformable array-like objects > Error in normalizeNindex(Nindex, seed) : > 'Nindex' must be a list with one list element per dimension in 'x' > Error in normalizeNindex(Nindex, seed) : > 'Nindex' must be a list with one list element per dimension in 'x' > Error : subscript contains out-of-bounds indices > Error : subscript contains invalid names > Error : subscript contains out-of-bounds ranges > Error : subscript contains out-of-bounds ranges > Error in new_DelayedUnaryIsoOpStack(.TEST_SAS3, NULL) : > 'OPS' must be a list > Error in FUN(X[[i]], ...) : > 'OPS[[1L]]' is not a function, character or symbol > Error in get(as.character(FUN), mode = "function", envir = envir) : > object 'not-an-existing-function' of mode 'function' was not found > automatic block size set to 1e+07 bytes (was 1e+08) > automatic block size set to 1e+08 bytes (was 1e+07) > > > RUNIT TEST PROTOCOL -- Tue Feb 19 11:13:58 2019 > ********************************************************************************************* > Number of test functions: 42 > Number of errors: 0 > Number of failures: 0 > > > 1 Test Suite : > DelayedArray RUnit Tests - 42 test functions, 0 errors, 0 failures > Number of test functions: 42 > Number of errors: 0 > Number of failures: 0 >

Kasper D. Hansen (11:18:14): > Why on earth do I stare at tons of errors and yet the test protocol reports 0 failures / 0 erros?

Kasper D. Hansen (11:22:26): > anyway, deleting everything and starting from scratch, so I won’t be able to reproduce anything

Aaron Lun (11:31:54): > FYI, the gobbledy gook from > > automatic block size set to 24 bytes (was 1e+08) > automatic block size set to 40 bytes (was 24) > automatic block size set to 100 bytes (was 40) > > is due to block resizing during matrix multiplication to get enough and/or correctly-sized chunks sent off to each worker.

Philipp Wahle (11:38:23): > @Philipp Wahle has joined the channel

Adonis Cedeno (11:48:17): > @Adonis Cedeno has joined the channel

Kasper D. Hansen (11:52:59): > Actually I get the same output with a fresh installation in R-devel and Bioc devel

Kasper D. Hansen (11:54:06): > I removed all packages, by nuking thesite-libraryI had (where everything apart from recommended packages reside). Then ranBiocManager::install()and repeated the testing

Kasper D. Hansen (11:54:55): > By nuking I meanrm -Rf

Aaron Lun (12:08:35): > Yeah, that looks pretty messed up. What’s the Rdevel revision?

Kasper D. Hansen (12:10:16): > > R Under development (unstable) (2019-02-18 r76122) -- "Unsuffered Consequences" > Copyright (C) 2019 The R Foundation for Statistical Computing > Platform: x86_64-apple-darwin18.2.0 (64-bit) >

Luiz Antonio de Jesus Rocha (12:13:20): > @Luiz Antonio de Jesus Rocha has joined the channel

Aaron Lun (14:14:48): > I can confirm that I see the same errors. Latest Rdevelr76128,HEADofDelayedArrayon Github (edacd94).

Aaron Lun (14:17:23): > Though having looked at these closer, they are probably not errors, but deliberately triggered exceptions.

Aaron Lun (14:17:41): > That’s why they don’t get reported as errors byRUnit, because they are wrapped incheckExceptions.

Aaron Lun (14:18:33): > It is a bit misleading, though.testthat’s less noisy about it.

Shian Su (17:11:00): > As a side note, how do people cleanly manage release and devel versions of BioC and R on the same machine?

Nitesh Turaga (17:11:27): > One very easy way is to just use Docker images

Nitesh Turaga (17:11:53): > No installation needed for R. Just install docker, and you are set.

Nitesh Turaga (17:12:33): > http://bioconductor.org/help/docker/

Shian Su (17:14:59): > Thanks! Looks like I finally have to learn Docker. Any non-sudo suggestions?

Nitesh Turaga (17:15:23): > Docker is the ultimate non-sudo suggestion:smile:

Nitesh Turaga (17:15:37): > If you had to install R yourself, it would be a lot more hassle.

Shian Su (17:17:06): > I was under the impression I needed sudo to use Docker. Can I just grab and mount an image without it?

Nitesh Turaga (17:18:06): > Do you have docker installted on your machine for your user?

Shian Su (17:20:07): > I can organise single time permission from IT, and I’m happy if I don’t need sudo after initial setup.

Fah Sathirapongsasuti (17:23:44): > @Fah Sathirapongsasuti has joined the channel

Martin Morgan (17:47:11) (in thread): > I build my R’s from source; they are installed under~/bin/R-3-8-branch/,~/bin/R-devel, etc. I maintain separate libraries for each Bioc release. The default R library contains only base and recommended packages, everything else is in an R / Bioc-version specific library. Maybe this alias provides some hints > > $ alias bioc-3.9 > alias bioc-3.9='R_LIBS_USER=~/Library/R/3.6/Bioc/3.9 ~/bin/R-devel/bin/R --no-save --quiet' > > In R,.libPaths()contains as the first directory the version-specific location where packages will be installed; the second directory contains base packages.

Shian Su (17:51:35) (in thread): > This along with a distinct Rprofile sounds like it should be sufficient and straightforward.

Kasper D. Hansen (19:17:49) (in thread): > when Bioc-devel goes with R-devel I just have an R-devel and a R version installed. In the other 6 months I have (say) R-3.5 and R-3.5.x. The “x” signifies a newer Bioc

Kasper D. Hansen (19:18:45): > Ok, thanks@Aaron Lunif these are indeed expected exceptions. The output is very confusing though

Nitesh Turaga (20:06:00): > @Shian Suhttp://ropenscilabs.github.io/r-docker-tutorial/

2019-02-20

Mike Smith (05:19:52) (in thread): > Can you integrate them both with RStudio? I always end up predominantly working in which ever version I’ve currently got first in my$PATHand then using the command line to explicitly run the other forR CMD checketc

Kellie Kravarik (11:35:24): > @Kellie Kravarik has joined the channel

Wanding Zhou (12:09:11): > @Wanding Zhou has joined the channel

2019-02-21

Dario Righelli (06:06:17) (in thread): > For R-studio, on Ubuntu I created an R-studio-devel “icon” which automatically runs the installed R-devel version.

Dario Righelli (06:07:31) (in thread): > So, if I need R (stable) I use the classic Rstudio, otherwise the Rstudio-devel

Martin Morgan (06:09:26) (in thread): > The linux / macOS solution I use is to create rstudio aliases, e.g., > > $ alias rstudio > alias rstudio-3.9='RSTUDIO_WHICH_R=~~/bin/R-devel/bin/R R_LIBS_USER=~~/Library/R/3.6/Bioc/3.9 /Applications/RStudio.app/Contents/MacOS/RStudio' >

Dario Righelli (06:18:25) (in thread): > I made a similar solution by creating an sh script within (sorry don’t know how to put code here): > —- > #!/bin/bash > > export RSTUDIO_WHICH_R=/usr/local/lib/R-devel/bin/R > > rstudio > —- > > And then creating an ubuntu launcher which runs this sh script.

Martin Morgan (06:47:24) (in thread): > use triple back tick ‘fences’ for code (“`” instead of “-”)

Jianhong (08:20:15) (in thread): > maybe you want to have a try with singularity. But as my experience, lots of docker can not directly converted into singularity image.

Dario Righelli (08:43:42) (in thread): > > almost the same of R-markdown! > > thanks@Martin Morgan

Steve Tsang (13:44:59): > @Steve Tsang has joined the channel

2019-02-23

Levi Waldron (17:19:14) (in thread): > I’ve made Docker and Singularity images based on the Bioconductor devel and release Docker images with a bunch of system dependencies added, with a shell script and aliases to make it easier to get started, atGitHub.com/waldronlab/bioconductor_devel. Yes Singularity is likely what you can get on a cluster or other administered computer.

2019-02-24

Shian Su (17:30:05) (in thread): > Awesome, thanks Levi!

Ning Leng (23:26:02): > @Ning Leng has joined the channel

2019-02-27

Veer Singh Marwah (11:03:11): > @Veer Singh Marwah has joined the channel

2019-03-05

john (10:26:48): > @john has joined the channel

john (10:33:49): > Seeking free-lance bioanalytic programmer (New York or Boston area preferred)

Levi Waldron (17:34:28) (in thread): > I am moving towards just using docker to run devel and release side by side, gradually working out the kinks that appear in routine usage… I’m getting there.

Levi Waldron (17:36:01) (in thread): > (and I’ll try to make it single-command easy to use, stay tuned…)

Ana Beatriz Villaseñor Altamirano (20:55:51): > @Ana Beatriz Villaseñor Altamirano has joined the channel

Alejandro Reyes (21:01:28): > @Alejandro Reyes has joined the channel

César Miguel Valdez Córdova (21:09:50): > @César Miguel Valdez Córdova has joined the channel

Joselyn Chávez (21:54:02): > @Joselyn Chávez has joined the channel

2019-03-06

Ben Story (06:44:50): > @Ben Story has joined the channel

Hyun-Hwan Jeong (15:44:53): > @Hyun-Hwan Jeong has joined the channel

2019-03-11

Johannes Rainer (01:05:19): > @Johannes Rainer has joined the channel

mirna (16:58:46): > @mirna has joined the channel

2019-03-12

Lambda Moses (01:23:47): > @Lambda Moses has joined the channel

Norman Ding (21:33:41): > @Norman Ding has joined the channel

2019-03-13

Charlotte Soneson (03:43:21): > In case someone here is interested in a general, end-to-end bulk RNA-seq (snakemake-based) workflow with strong Bioc focus (for example, all data and results saved in SingleCellExperiment objects), you could check out our new ARMOR:slightly_smiling_face:. Comes with a small dataset that can be used for teaching. Of course, feedback is welcome.https://github.com/csoneson/armor. Preprint:https://www.biorxiv.org/content/10.1101/575951v1

Sean Davis (13:59:49): > I had made a note to myself to address the use of ORCIDs (https://orcid.org/) for authors in DESCRIPTION files. It looks like including ORCID is now recommended by CRAN when applicable. Are folks doing this for Bioc packages? If not, should we make an attempt to encourage them? See last item on this checklist:https://cran.r-project.org/web/packages/submission_checklist.html - Attachment (cran.r-project.org): Checklist for CRAN submissions > Checklist for CRAN submissions

Federico Marini (14:44:27): > I am putting mine in the authors field - as of now it renders quite raw on the Bioc pages

Federico Marini (14:44:37): > but it would be nice to have it as on CRAN

Kasper D. Hansen (16:09:23): > I think it makes sense to try to support this. Anything involved in better / easier credit for authors is a good idea

2019-03-14

Vince Carey (12:52:36) (in thread): > Very interesting@Charlotte Soneson. I am wondering about the strength of commitment to snakemake. Through the AnVIL project we have started to work with WDL and CWL, as workflow languages that are supported atdockstore.org, which is GA4GH-endorsed. Also, do you envision a Bioconductor workflow package emerging from an ARMOR run – perhaps along the lines of@Aaron LunsimpleSingleCell workflow?

Charlotte Soneson (13:16:34) (in thread): > Thanks@Vince Carey. For snakemake - we chose it since we were familiar with it, since it integrates nicely with conda environments (and has lots of other nice features) and since it’s reasonably widespread. I haven’t looked in detail at other workflow languages, but I’ll check it out. As for the workflow, if there is an interest in having an additional one (in addition to the already existing bulk DGE and DTU ones), we could consider that. However, I’m not sure it would be the right form - the simpleSingleCell workflow is all R-based whereas the R parts of ARMOR are kind of covered by the existing workflows (apart from the iSEE visualization), and we don’t actually run them from within R.

Vince Carey (13:47:16) (in thread): > I understand. Snakemake is popular in our lab too.

2019-03-15

Lluís Revilla (04:56:22): > I added the ORCID it would be nice to render with the logo in Bioconductor too

Lori Shepherd (07:15:27): > I’ll look into the rendering of the landing pages and updated the package guidelines to include ithttp://bioconductor.org/developers/package-guidelines/#description

Sean Davis (07:38:29): > Thanks,@Lori Shepherd, for updating the package guidelines. The ORCID instructions there look good to me.

Lori Shepherd (07:39:24): > I’ll look into the rendering on the landing pages soon

2019-03-17

gamzeaydilek (07:12:04): > @gamzeaydilek has joined the channel

2019-03-22

Joan (09:20:49): > @Joan has joined the channel

2019-03-24

Yoon, Tae-Hyun (02:53:59): > #introductions

2019-03-28

bioinfodirtyjobs (06:43:59): > @bioinfodirtyjobs has joined the channel

Thomas Schwarzl (11:45:58): > @Thomas Schwarzl has joined the channel

2019-03-31

Helena L. Crowell (02:37:32): > @Helena L. Crowell has joined the channel

MuthannaWaeli (07:04:27): > @MuthannaWaeli has joined the channel

2019-04-02

saskia (02:05:54): > @saskia has joined the channel

Dario Strbenac (07:03:44): > @Dario Strbenac has joined the channel

2019-04-03

Levi Mangarin (10:45:27): > @Levi Mangarin has joined the channel

KeLiu (13:47:00): > @KeLiu has joined the channel

Tao Liu (16:51:08): > @Tao Liu has joined the channel

2019-04-04

Lori Shepherd (10:57:29): > <!channel>The orcid’s are now displaying on the landing pages as a link to the site -

Lluís Revilla (12:32:59): > As a minor note in CRAN the parenthesis around the ORCID logo disappear

Lluís Revilla (12:33:12): > if it is not too complicated it would look nicer:sweat_smile:

Sean Davis (13:27:14) (in thread): > Wow,@Lori Shepherd. Nine fast parrots and two conga parrots. I think that might be a new record.:smile:

Stephanie Hicks (14:57:45) (in thread): > i’ll make that 10 fast parrots!

Ruoxi Liu (21:28:49): > @Ruoxi Liu has joined the channel

2019-04-05

Dave Tang (02:46:03): > @Dave Tang has joined the channel

Lori Shepherd (09:16:45) (in thread): > Should be nicer in next build of the website - think i removed the ugly ()

Lluís Revilla (09:26:47) (in thread): > Great! Thanks for the changes!:ok_hand:

2019-04-06

Joan Gibert (13:25:13): > @Joan Gibert has joined the channel

FeiZhao (20:15:37): > @FeiZhao has joined the channel

2019-04-09

Sean Davis (09:36:49): > The Bioconductor F1000Research Gateway call-for-submissions is open for this year. The “soft” deadline is July 5 to allow folks presenting workshops or other material at the conference (or not) to incorporate feedback from the conference.https://f1000research.com/gateways/bioconductor/about-this-gateway

Craig (11:58:35): > @Craig has joined the channel

Kevin Stachelek (18:06:46): > @Kevin Stachelek has joined the channel

2019-04-10

Sridhar N (12:05:47): > @Sridhar N has joined the channel

Hervé Pagès (17:05:30) (in thread): > 15 fast parrots today (just added mine)

2019-04-13

Martin Morgan (19:25:16): > I changed the ‘name’ of this channel to ‘community-bioc’, to match the url; I’m not sure whether that breaks all kinds of things, and apologize if it does…

2019-04-15

Tiago Lubiana (10:08:28): > @Tiago Lubiana has joined the channel

Almut (10:59:21): > @Almut has joined the channel

Jon Bråte (12:23:30): > @Jon Bråte has joined the channel

Michael Love (19:14:12): > Cool presentation on Bioc essentialshttps://twitter.com/bioconductor/status/1117790518958796802?s=21 - Attachment (twitter): Attachment > #rstats / @Biocondcutor #biomenbf2019 #biomenbf Bioconductor presentation at https://docs.google.com/presentation/d/e/2PACX-1vS25aIVGDGCAJMsG1VNIxKaUbU7tsefGwMzAIxBLtUJG8ZO44m0zjYRvajli18xSbzwCgFrU_Zlb9kq/pub?start=false&loop=false&delayms=3000&slide=id.p

2019-04-17

Mitch (02:32:07): > @Mitch has joined the channel

Zhi Yang (18:06:28): > @Zhi Yang has joined the channel

2019-04-18

Yoon, Tae-Hyun (02:27:43): > #bioc2019

Yoon, Tae-Hyun (02:29:19): > #biocasia

Vince Carey (07:19:05): > Is anyone interested in discussing how to limit the verbosity of package startup messages? I know about suppressPackageStartupMessages and frequently use it in vignettes. It is common to have a few screensful of startup message shoot by when a package is attached. It would often be just as useful to have a single line: “x/y packages newly attached/loaded; use sessionInfo() for details.” I have a version of library() in my .Rprofile so that > > > library(MultiAssayExperiment) > 13/8 packages newly attached/loaded, see sessionInfo() for details. > > library(DESeq2) > 1/53 packages newly attached/loaded, see sessionInfo() for details. > > Information on conflicts is lost and perhaps should be concisely summarized.

Vince Carey (07:31:48): > FWIWhttps://gist.github.com/vjcitn/9cb7373c5fe2ee8d514260e5b9dd910chas the code

Vince Carey (07:56:12): > Apropos conflicts, how would one summarize > > > conflicted::conflict_scout() > 0 conflicts > > library(MultiAssayExperiment) > 13/8 packages newly attached/loaded, see sessionInfo() for details. > > conflicted::conflict_scout() > 166 conflicts: > * `%in%` : [S4Vectors] > * `aggregate` : [S4Vectors] > * `AIC` : [stats4] > * `anyDuplicated` : [BiocGenerics] > * `anyMissing` : matrixStats, Biobase > ... >

Kasper D. Hansen (08:53:28): > I agree we (R) need to do something about the startup messages. They are too verbose and are just being ignored by users

Kasper D. Hansen (08:53:37): > and too intimidating for new users

Federico Marini (09:01:00): > Your proposal is quite nice@Vince Carey

Federico Marini (09:01:33): > I don’t know the iter to make it sneak into the core code of R, but I would not dislike it at all!

Martin Morgan (09:15:11): > maybe capture the messages and provide a way to retrieve them? replacingsessionInfo()above withstartupMessages()?

Vince Carey (09:47:10): > Yes, that sounds like a project. It would be good to stem the loss of information in my approach. Does startupMessages() exist? I don’t see it, and I wonder where the information reported is recorded by R (if it is in fact recorded as opposed to simply dumped).

Martin Morgan (09:57:24): > startupMessage()is just a twinkle in the universe, it does not yet exist. I think it would have to be captured usingwithCallingHandlers()and a (package) global variable of some sort > > .messages <- local({ > value <- NULL > list(append = function(message) { > value <<- c(value, message) > }, get = function() { > cat(value, sep="\n") > }) > }) > > startupMesssages <- .messages$get > > library <- function(...) { > withCallingHandlers({ > base::library(...) > }, packageStartupMessage = function(m) { > .messages$append(conditionMessage(m)) > invokeRestart("muffleMessage") > }) > } >

Federico Marini (10:05:12): > Maybe the tidyverse team could be already working on such a thing/thinking of doing it?

Federico Marini (10:05:27): > Recently they did a major rewrite of the traceback system

Marko Zecevic (10:29:38): > @Marko Zecevic has joined the channel

2019-04-19

Aaron Lun (20:19:23): > @Martin MorganI don’t suppose you have any neat hacks to makeFastqStreamer+yieldrun faster?

Aaron Lun (20:19:35): > e.g., if we don’t need the qualities.

2019-04-20

Martin Morgan (03:33:26) (in thread): > I don’t think there’s anything exposed; using a large readerBlockSize (# bytes read at a time) is obviously important. Also the slowest part is parallelized using OpenMP with a mechanism for determining number of threads (an internal call to.set_omp_threads) in?FastqStreamer, which of course requires OpenMP to be available.

Aaron Lun (03:39:14) (in thread): > Okay, thanks. It looks like this is auto-set to max out the threads, so I presume that no developer action is required to take advantage of this - other than ensuring openMP is available.

Aaron Lun (03:40:15) (in thread): > I don’t know whether loading the qualities (and possibly names, for some files with long names) would take up a lot of time, but if it does, one might consider an option to skip them if only the sequences are of interest.

Dario Strbenac (22:30:05): > Should I add this feature request to VariantAnnotation’s issue tracker? > > I wonder if set operation functions for objects created by readVCF could be incorporated in VariantAnnotation? For example, union and intersection. It is a bit longer than simply using such functions on the rowRanges of the object, because different ALT alleles need to be considered separately, so a standard, tested implementation might be useful. For example,intersection(DStissueSNVs, DScellCultureSNVs).

Martin Morgan (22:43:53) (in thread): > Tested pull requests would certainly be welcome!

Dario Strbenac (22:55:40) (in thread): > I might try it in the next couple of months.

2019-04-21

Tom Gleeson (06:28:44): > @Tom Gleeson has joined the channel

2019-04-22

Vince Carey (10:20:42): > I am finding that devtools::document() in the current rhdf5client source throws a > > Error in getDLLRegisteredRoutines.DLLInfo(dll, addNames = FALSE) : > must specify DLL via a "DLLInfo" object. See getLoadedDLLs() > > There is a workaround via preceding the call to devtools::document() with a pkgbuild::compile_dll(). Is there some method of dll registration or roxygen coding that we are missing?

2019-04-23

Aaron Lun (01:49:59): > I don’t get these problems if I install the package from the directory, i.e.,R CMD INSTALL rhdf5client(or wherever your source directory is), followed bydevtools::document()within the directory. I would guess thatdocument()is trying to load the shared library from the same directory, and failing if you haven’t tried to compile it at least once by trying to install the package from source.

darlanminussi (12:47:03): > @darlanminussi has joined the channel

Michael Love (19:05:48): > This thread is a case study in why to use accessor functionshttps://twitter.com/claire__malley/status/1120748364461744128?s=21 - Attachment (twitter): Attachment > object@raw.data and object@data are now object@assays$RNA@counts, object@assays$RNA@data. :skull: or they want you to use GetAssayData(object = object, slot = “counts”).

Michael Love (19:06:01): > I feel for both sides

Aaron Lun (19:07:19): > Or they could have just joined us with SingleCellExperiment.

Michael Love (19:11:22): > Boom.

Aaron Lun (19:12:03): > @Davide Rissodid offer.

Michael Love (19:12:21): > I think I knew that

Michael Love (19:13:32): > Well here, they could still learn from Bioc and generate an update function

Michael Love (19:14:10): > https://www.rdocumentation.org/packages/BiocGenerics/versions/0.18.0/topics/updateObject

Michael Love (19:14:42): > Does anyone know them? Point them to the updateObject code for SE?

Aaron Lun (19:15:05): > Well, I know Rahul somewhat well. Less so for the developers doing the grunt work.

Aaron Lun (19:15:29): > But I don’t depend on them, and they don’t depend on me, so I keep my nose out of it by and large.

Aaron Lun (19:16:21): > You might say that’s a pretty mercenary attitude. And you’d be right.

Aaron Lun (19:16:42): > Didn’t go to SF just for the weather.

2019-04-24

Davide Risso (03:42:02) (in thread): > That’s good since summer is about to begin…

2019-04-25

Jake Taylor-King (06:28:06): > @Jake Taylor-King has joined the channel

Brendan Innes (19:36:14) (in thread): > I imagine they do have an update function, they did between v1 -> v2. Doesn’t really help with backwards compatibility for existing code, but its something.

2019-04-26

Michael Love (15:05:56) (in thread): > oh i see. yeah breaking code is only solved with accessors (that you don’t change)

2019-04-29

Simina Boca (19:20:27): > When sending an email to bioc-devel is it not OK to include screenshots?

Simina Boca (19:20:43): > I sent one on Friday and got an automated reply back “The message’s content type was not explicitly allowed”

Simina Boca (19:21:21): > Wasn’t sure if that referred to the screenshot of the git error that I included or if it was something else (not sure what else it could be though)

Martin Morgan (19:27:53): > The attachment types accepted by the bioc-devel mailing list are pretty weird; the best bet is to send simple plain-text messages without attachments.

Simina Boca (19:29:23): > Thank you!

Simina Boca (19:29:47): > I could swear I used screenshots before but maybe I didn’t

2019-04-30

Anna Quaglieri (23:26:43): > @Anna Quaglieri has joined the channel

2019-05-01

dave_sevenbridges (17:04:50): > @dave_sevenbridges has joined the channel

dave_sevenbridges (17:05:58): > Hello everyone. Thank you for inviting me in to this Bioconductor community. I am a community engagement~~~manager~~~at Seven Bridges Genomics.

2019-05-02

Sean Davis (06:37:32): > Hi,@dave_sevenbridges. Good to hear from you.

Kasper D. Hansen (09:33:35): > Looking forward to being managed

dave_sevenbridges (10:03:13): > yeah the “manager” part of the job title not ideal because I don’t actually manage anyone. I just started at SB in April so maybe they will let me modify.

Kasper D. Hansen (10:08:57): > It seems to be a common title (I have heard it before) in companies. I am just making fun of it.

dave_sevenbridges (10:19:18): > is there a channel for BioC 2019?

Marcel Ramos Pérez (10:19:34): > Yes#bioc2019

2019-05-05

Kevin Wang (01:06:19): > Hi, does anyone have an idea of when the R3.6 branch of Bioconductor Docker will be released? Thanks!

Firas (10:55:42): > @Firas has joined the channel

2019-05-06

Luka (10:55:38): > @Luka has joined the channel

Vince Carey (12:08:47): > @Nitesh Turagado you have a comment on R3.6-based docker availability? Thanks

Nitesh Turaga (12:15:52): > If you are talking about the bioconductor_full images, i’m hoping to get that done within the coming few days. If it is regarding bioc_docker images, maybe@Lori Shepherdhas a better idea of when they can happen.

Lori Shepherd (12:31:41): > I’ll be working on this the next few days

Kevin Rue-Albrecht (14:20:56): > Unrelated: what is going on with theExperimentHubon devel?:grimacing:http://bioconductor.org/checkResults/devel/bioc-LATEST/ExperimentHub/malbec1-buildsrc.html > > unknown key 'EH166' >

Martin Morgan (14:24:01): > The hubs seems to have a problem in devel;@Lori Shepherdis aware of this

Lori Shepherd (14:25:04): > Yes I’m looking into the issues with both Annotatino and ExperimentHub in devel …

Kevin Rue-Albrecht (14:30:42): > Ok. Good luck and thanks !:+1:

Haibo Liu (15:20:51): > @Haibo Liu has joined the channel

2019-05-08

Selvi G (09:35:09): > @Selvi G has joined the channel

Kevin Rue-Albrecht (18:00:04) (in thread): > Thanks again!

2019-05-11

Martin Morgan (13:15:28): > Is sparklyr the choice for using R in spark? Is there a containerized deployment of spark + R? tagging@Sean Davisjust in case…

Sean Davis (13:54:07): > Sparkly is for spark in R. Sparkr is the R api for R in spark.

Sean Davis (13:56:16): > For many purposes, sparklyr will probably suffice. It also provides functionality to drop back to Scala if needed.

Sean Davis (14:55:01): > Spark, as a high-level processing suite, uses the DataFrame as its data structure. That maps nicely to tidy dataset approaches; SparklyR maps the underlying spark DataFrame operations to familiar tidy R idioms.

Sean Davis (15:07:22): > I’ll add that the various APIs (scala, python, R, …) are not all feature-complete and the helpers that are included vary as well. The R API is the least rich right now.

2019-05-13

Dario Strbenac (03:15:15) (in thread): > It turns out that it’s remarkably complicated because there are multiple correct ways to represent a varianthttps://www.nature.com/articles/s41587-019-0054-x/figures/2

Corina Lesseur (10:46:42): > @Corina Lesseur has joined the channel

2019-05-14

Mamie Wang (13:51:12): > @Mamie Wang has joined the channel

2019-05-15

Stephanie Hicks (10:12:34): > https://twitter.com/joachimgoedhart/status/1128399392090857473?s=21 - Attachment (twitter): Attachment > #Preprint #Protip: > When you include an image on the first page of the pdf submitted to @biorxivpreprint it’s shown on the webpage right under the abstract - examples: > https://www.biorxiv.org/content/10.1101/578575v1 > https://www.biorxiv.org/content/10.1101/160374v1 https://pbs.twimg.com/media/D6jhQV4XkAY-vys.jpg

Martin Morgan (17:23:52): > Chan Zuckerberg open source software supporthttps://twitter.com/cziscience/status/1128693937130991623 - Attachment (twitter): Attachment > Open source software is :key: to science. Many of the packages, libraries + applications crucial to biomedicine are built by researchers who volunteer their time + effort to make these tools available. We’re excited to announce support for #opensource tools https://bit.ly/2Q4G1zR

Dario Strbenac (20:00:28): > It’s nice that they put software maintenance as a key aspect, not just developing more and more new methods.

Kasper D. Hansen (20:13:46): > Agreed

Krithika Venkataraman (21:53:03): > @Krithika Venkataraman has joined the channel

2019-05-16

Martin Morgan (11:44:14): > Bioconductor has for many years had an ad hoc Technical Advisory Board to provide guidance on technical issues related to the project. In February, 2019, we formalized governance of the advisory board. More information is available athttps://bioconductor.org/about/technical-advisory-board. > > An important outcome of the updated governance is provision for more transparency and participation by the Bioconductor community. Look for an opportunity to nominate new members to the Board in the coming weeks. Monthly meeting minutes will also be made available (from the link above), starting with our May, 2019 meeting. > > See also the support site NEWS postinghttps://support.bioconductor.org/p/121130/. > > Martin Morgan > Bioconductor

Aaron Lun (12:16:52): > :+1:

Aaron Lun (12:17:56): > though gender balance could do with some work.

Stephanie Hicks (13:34:51): > Thanks@Martin Morgan!

Kevin Rue-Albrecht (15:00:23) (in thread): > > Grants are not permitted to individuals; only to organizations. > Any idea/experience how that would work roughly?

Kevin Rue-Albrecht (15:01:49) (in thread): > Actually, it gets clearer in one of the subsequent bullet points > > If an application does not come from an organization eligible to receive and distribute funds (e.g., an academic institution), the applicant may designate a fiscal sponsor (e.g., NumFOCUS, Code for Science & Society, or others).

Martin Morgan (16:28:26) (in thread): > Almost all academic grants are to the institution where the ‘grantee’ works, rather than to the grantee, even though the grantee speaks as though it is ‘their’ grant. In some ways this makes a lot of sense – all the responsibility for, e.g., responsible fiscal management is on the institution. So the ‘fiscal sponsor’ would really be relevant only to the truly independent researcher…

Kevin Rue-Albrecht (16:30:44) (in thread): > Riiiiight.. I really need to resurface from the science and attend a grant writing workshop one of these days to pick up a thing or two:sweat_smile:

Kevin Rue-Albrecht (16:31:09) (in thread): > (Almost forgot: Thanks!)

2019-05-17

Astrid Deschenes (15:55:11): > @Astrid Deschenes has joined the channel

2019-05-18

Varun Ramani (06:45:04): > @Varun Ramani has joined the channel

2019-05-20

Assa (05:26:08): > @Assa has joined the channel

Sean Davis (06:44:22): > To provide yet another mechanism for marketing your papers and preprints related to Bioconductor, I started a channel,#papersandpreprints. Post your own work or that of your colleagues.

2019-05-21

Juan Rebollo (17:31:03): > @Juan Rebollo has joined the channel

Jeff Gentry (21:30:09): > @Jeff Gentry has joined the channel

2019-05-22

Martin Morgan (14:23:38): > Only 9 registrations left for the main conferencehttps://bioc2019.bioconductor.org! - Attachment (BioC 2019): BioC 2019: Where Software and Biology Connect > Where Software and Biology Connect. June 24 - 27, New York City, USA.

Sridhar N (17:53:15): > Got mine last week! Hope to meet and learn cool stuff from y’all!

James Ban (21:32:54): > @James Ban has joined the channel

2019-05-23

Kathy Sivils (10:12:03): > @Kathy Sivils has joined the channel

Nitesh Turaga (13:13:30): > http://shortdoi.org/

Nitesh Turaga (13:13:35): > Pretty neat stuff.

Frederick Tan (13:20:10) (in thread): > @Nitesh TuragaTimely for thishttps://twitter.com/lpachter/status/1131546083249643530 - Attachment (twitter): Attachment > If you think articles should be valued for their content rather than the journals they were published in, use the citation form “Author(s), year, DOI” on slides and omit the journal name.

Nitesh Turaga (13:21:18) (in thread): > Haha! I got it on my feed from Michael Hoffman.

lara mcgrath (14:51:31): > @lara mcgrath has joined the channel

lara mcgrath (14:54:16): > @Martin Morganjust missed registration is there a waitlist?

Martin Morgan (15:09:14): > hmm there should be! let me see about coordinating that…

lara mcgrath (16:06:41): > thanks!

SM (16:37:35): > @SM has joined the channel

Michael Sierant (17:24:34): > @Michael Sierant has joined the channel

Martin Morgan (19:16:45): > @lara mcgratha waitlist has been established athttps://forms.gle/y7v53HPbw5cfHt556 - Attachment (Google Docs): BioC 2019 Waitlist > The main Bioconductor conference is full. Add your name to this waitlist and we will be in touch with any openings that appear.

2019-05-24

lara mcgrath (08:10:01): > thats great, thanks!

Pratima Chennuri (13:13:54): > @Pratima Chennuri has joined the channel

Simina Boca (15:36:18): > Would it be possible to add the FAQshttps://bioconductor.org/developers/how-to/git/faq/to thehttps://bioconductor.org/developers/page?

Aaron Lun (15:40:02): > What’s wrong with the “git source control” link?

Nitesh Turaga (15:41:30): > is the location not clear?@Simina Boca

Simina Boca (15:41:48): > haha maybe it’s just me

Simina Boca (15:41:59): > I always check question 14 on it, as per Nitesh’s suggestion in a previous query

Simina Boca (15:42:10): > and I have to go through 2 pages to get to it

Simina Boca (15:42:15): > if it’s not an issue for others it’s fine

Simina Boca (15:43:35): > I always have a hard time with package updates:disappointed:

Nitesh Turaga (15:45:37): > Putting in on thehttps://bioconductor.org/developers/page would distort the sequence of important links on the developers page. > > We do have a#bioc_gitchannel, if you want more immediate help with git rather than the bioc-devel mailing list.

Simina Boca (15:46:13): > thanks, will join that!

Martin Morgan (15:52:30): > what about changing the linkGit Source ControltoGit Source Control & FAQ, with two separate links?

2019-05-27

Justin (15:53:45): > @Justin has joined the channel

Ni Zhao (18:30:49): > @Ni Zhao has joined the channel

2019-05-28

Kirk Reardon (02:53:17): > @Kirk Reardon has joined the channel

Michelle Miron (13:51:19): > @Michelle Miron has joined the channel

Sridhar N (17:59:05): > is there a good paper/tutorial/talk that explains in details how tmm(edgeR) rlog(deseq2) works? with real numbers used in the equation?

Aaron Lun (18:00:48): > What’s wrong with the 2010 GB paper for TMM?

Sridhar N (18:12:11): > nothing, trying to explain this to biologists and first time learners

Sridhar N (18:12:46): > i doubt i am good with presentations and explaining stuff

Dario Strbenac (20:00:14): > If you have experience with the Shiny framework, perhaps you could create a Shiny application that shows an M-A plot and colour the points according to the amount of trimming the user sets with a slider bar and the median line will change position as the amount of trimming changes? Regularised log isn’t easy to explain to people outside of the field. You could just say it’s a “special” log2.

Sridhar N (20:53:30): > haa

Sridhar N (20:53:34): > true i thought about that

Sridhar N (20:53:46): > shiny is a good idea

Shian Su (21:00:16): > gganimate would be a light weight way to show just TMM normalisation on default settings.

Shian Su (21:04:34): > I’m always struggling to explain normalisation and have been meaning to learn gganimate, so there’s a chance I make something this week for this purpose.

Sridhar N (21:04:49): > happy to contribute

Sridhar N (21:04:58): > in any way possible,

Sridhar N (21:05:12): > can create a PR etc on github

Sridhar N (21:05:40): > shiny is ofcourse good but i thin gganimate will be a better choice

2019-05-31

Oriol Pavón (13:03:57): > @Oriol Pavón has joined the channel

2019-06-04

Erica Feick (06:53:33): > @Erica Feick has joined the channel

Dario Righelli (08:49:28): > Hi, is there any function/method/approach to retrieve the chromosome lengths, given a genome code (as mm9) ?

Dario Righelli (08:49:55): > and maybe not just for human and mouse species…

Dario Righelli (08:52:52): > The aim is to produce a genomicRanges within the chromosomes of a genome

Dario Righelli (08:53:35): > if I use the GenomeInfoDB::Seqinfo() it doesn’t return the lengths, but only the seqnames

Nitesh Turaga (08:55:46): > I thinkDNAStringSetwill give you that information if you are able to cast your entire sequence in that way, > > library(Biostrings) > a <- DNAStringSet( list(chrI=DNASring("ATTGACTAGC"), chrII=DNAString("ACGTACGT")) ) >

Nitesh Turaga (08:56:21): > If you print out the DNAStringSet object, thewidthcolumn will give you the lengths.

Nitesh Turaga (08:57:05):

  A DNAStringSet instance of length 2
    width seq                                               names
[1]    12 ATGCATGCATCG                                      chrI
[2]     8 ACGTACGT                                          chrII

Nitesh Turaga (08:58:16): > https://bioconductor.org/packages/release/bioc/vignettes/Biostrings/inst/doc/BiostringsQuickOverview.pdf

Dario Righelli (08:58:33): > that’s cool, but I need the genome… I don’t have a reference string.

Dario Righelli (08:59:05): > I need to subtract a defined genomeRange from it’s reference genome

Dario Righelli (08:59:28): > and storing the subtracted regions

Dario Righelli (08:59:36): > and setdiff works properly!

Dario Righelli (08:59:53): > but I need to generalize by retrieving any kind of genome

Dario Righelli (09:00:38): > Maybe biomaRt is the only candidate

Malte Thodberg (09:00:43): > as(SeqinfoForUCSCGenome("mm9"), "GRanges")?

Dario Righelli (09:02:52): > thanks@Malte Thodbergit seems to work!:woman-lifting-weights:

Martin Morgan (09:33:02): > I went toGenomicFeatures::getChromInfoFromUCSC("mm9"), thinking that genomic feature-related annotations should mostly be available from there; there’s also a*FromBiomart()but requiring a bit of biomart knowledge to use…

Ludwig Geistlinger (10:08:08): > Note that you can also directly retrieve the chromosome lengths from a corresponding TxDb package: > > > GenomeInfoDb::seqlengths(TxDb.Mmusculus.UCSC.mm9.knownGene) > chr1 chr2 chr3 chr4 chr5 chr6 > 197195432 181748087 159599783 155630120 152537259 149517037 > chr7 chr8 chr9 chr10 chr11 chr12 > 152524553 131738871 124076172 129993255 121843856 121257530 > chr13 chr14 chr15 chr16 chr17 chr18 > 120284312 125194864 103494974 98319150 95272651 90772031 > chr19 chrX chrY chrM chr1_random chr3_random > 61342430 166650296 15902555 16299 1231697 41899 > chr4_random chr5_random chr7_random chr8_random chr9_random chr13_random > 160594 357350 362490 849593 449403 400311 > chr16_random chr17_random chrX_random chrY_random chrUn_random > 3994 628739 1785075 58682461 5900358 > > assuming that such a package is available for your species of interest: > > > BiocManager::available("^TxDb") > [1] "TxDb.Athaliana.BioMart.plantsmart22" > [2] "TxDb.Athaliana.BioMart.plantsmart25" > [3] "TxDb.Athaliana.BioMart.plantsmart28" > [4] "TxDb.Btaurus.UCSC.bosTau8.refGene" > [5] "TxDb.Celegans.UCSC.ce11.ensGene" > [6] "TxDb.Celegans.UCSC.ce11.refGene" > [7] "TxDb.Celegans.UCSC.ce6.ensGene" > [8] "TxDb.Cfamiliaris.UCSC.canFam3.refGene" > [9] "TxDb.Dmelanogaster.UCSC.dm3.ensGene" > [10] "TxDb.Dmelanogaster.UCSC.dm6.ensGene" > [11] "TxDb.Drerio.UCSC.danRer10.refGene" > [12] "TxDb.Drerio.UCSC.danRer11.refGene" > [13] "TxDb.Ggallus.UCSC.galGal4.refGene" > [14] "TxDb.Ggallus.UCSC.galGal5.refGene" > [15] "TxDb.Hsapiens.BioMart.igis" > [16] "TxDb.Hsapiens.UCSC.hg18.knownGene" > [17] "TxDb.Hsapiens.UCSC.hg19.knownGene" > [18] "TxDb.Hsapiens.UCSC.hg19.lincRNAsTranscripts" > [19] "TxDb.Hsapiens.UCSC.hg38.knownGene" > [20] "TxDb.Mmulatta.UCSC.rheMac3.refGene" > [21] "TxDb.Mmulatta.UCSC.rheMac8.refGene" > [22] "TxDb.Mmusculus.UCSC.mm10.ensGene" > [23] "TxDb.Mmusculus.UCSC.mm10.knownGene" > [24] "TxDb.Mmusculus.UCSC.mm9.knownGene" > [25] "TxDb.Ptroglodytes.UCSC.panTro4.refGene" > [26] "TxDb.Ptroglodytes.UCSC.panTro5.refGene" > [27] "TxDb.Rnorvegicus.BioMart.igis" > [28] "TxDb.Rnorvegicus.UCSC.rn4.ensGene" > [29] "TxDb.Rnorvegicus.UCSC.rn5.refGene" > [30] "TxDb.Rnorvegicus.UCSC.rn6.refGene" > [31] "TxDb.Scerevisiae.UCSC.sacCer2.sgdGene" > [32] "TxDb.Scerevisiae.UCSC.sacCer3.sgdGene" > [33] "TxDb.Sscrofa.UCSC.susScr11.refGene" > [34] "TxDb.Sscrofa.UCSC.susScr3.refGene" >

Federico Marini (10:10:02): > We should save this small flurry of proposed solutions on the support site - I got exposed to at least one way I did not know directly:slightly_smiling_face:

Mike Smith (10:16:11): > Using the Ensembl REST API: > > library(httr) > library(jsonlite) > library(xml2) > > r <- GET("https://rest.ensembl.org/info/assembly/human?", content_type("application/json")) > > r %>% content() %>% > toJSON() %>% > fromJSON() %$% > top_level_region %>% > dplyr::filter(coord_system == "chromosome") %>% > head() > > > > coord_system length name > 1 chromosome 57227415 Y > 2 chromosome 64444167 20 > 3 chromosome 156040895 X > 4 chromosome 114364328 13 > 5 chromosome 50818468 22 > 6 chromosome 133797422 10 >

Vince Carey (13:11:57): > wow – is this toJSON/fromJSON trick frequently used? it seems to deal nicely with records that have identical fields, but that do not deliver them in identical order.

Lauren Fitch (14:37:33): > how doesmapFromAlignmentshandle S in the cigar string?

Lauren Fitch (14:50:53): > it looks like an S is not used in calculating the reference position, is that right?

Lauren Fitch (14:55:30): > it seems like they should be subtracted when calculating the reference position, S bases add to the length of the read but don’t map to the reference. it would make sense to me to handle them like an insertion but maybe I”m missing something

Sean Davis (16:19:32) (in thread): > Note thatcontentcan deliver results as text and then go intofromJSON.

Simone Bell (16:37:23): > @Simone Bell has joined the channel

2019-06-05

Mike Smith (04:57:26) (in thread): > Yes, you can equally do > > r %>% > content(type = "text/plain") %>% > fromJSON() >

Mike Smith (05:11:29) (in thread): > I think this can even be simplified toGRangesForUCSCGenome('mm9')

Mike Smith (09:09:45) (in thread): > I’ve written a short summary athttps://msmith.de/2019/06/05/chrom-lengths-in-bioc.html - Attachment (R Musing): Obtaining chromosome lengths in Bioconductor > The following question was asked in the Bioconductor slack channel

Federico Marini (09:11:36) (in thread): > Couldn’t ask for more, thanks:thumbsup:

Paddy St. David (18:11:48): > @Paddy St. David has joined the channel

julen (20:30:50): > @julen has joined the channel

2019-06-06

Hervé Pagès (05:04:01) (in thread): > It does return the lengths for me: > > > library(GenomeInfoDb) > > si <- Seqinfo(genome="mm9") > > si > Seqinfo object with 35 sequences (1 circular) from mm9 genome: > seqnames seqlengths isCircular genome > chr1 197195432 FALSE mm9 > chr2 181748087 FALSE mm9 > chr3 159599783 FALSE mm9 > chr4 155630120 FALSE mm9 > chr5 152537259 FALSE mm9 > ... ... ... ... > chr16_random 3994 FALSE mm9 > chr17_random 628739 FALSE mm9 > chrX_random 1785075 FALSE mm9 > chrY_random 58682461 FALSE mm9 > chrUn_random 5900358 FALSE mm9 >

Dario Righelli (05:16:27) (in thread): > that’s strange!:exploding_head:I have to try it again! Thanks!

Hervé Pagès (05:51:04) (in thread): > The position of a read in BAM (POS field) is the position of the 1st of its nucleotides that is aligned to the reference. The nucleotides at the ends of the read that got clipped by the aligner are not considered to be aligned to the reference so the 1st aligned nucleotide is the one that immediately follows those that got clipped. In other words an S should not be used in calculating the reference position because the position reported in the POS field has already taken the clipping into account. Yes you could see the clipped nucleotides as insertions at the ends of the read but note that these insertions would not affect the reference position either because, like for the clipped ones, the inserted nucleotides are not considered to be aligned to the reference. FWIW the GenomicAlignments package provides some low-level utils for playing with CIGAR strings. For example with two 82-nucleotide reads at POS 2019: > > > library(GenomicAlignments) > > cigarRangesAlongReferenceSpace(c("3S75M4S", "3I75M4I"), pos=2019, with.ops=TRUE) > IRangesList object of length 2: > [[1]] > IRanges object with 3 ranges and 0 metadata columns: > start end width > <integer> <integer> <integer> > S 2019 2018 0 > M 2019 2093 75 > S 2094 2093 0 > > [[2]] > IRanges object with 3 ranges and 0 metadata columns: > start end width > <integer> <integer> <integer> > I 2019 2018 0 > M 2019 2093 75 > I 2094 2093 0 > > S and I are mapped to zero width ranges on the reference. The 1st aligned nucleotide is the 4th one in each read and it aligns with reference position 2019.

Dario Righelli (10:03:03) (in thread): > @Hervé PagèsI found the problem… If you ask for the entire genome, it returns the seqlenghts, but if you ask for a specific chromosome “seqnames=“chr1”“, it doesn’t return its length. > Thanks for your reply!

Hervé Pagès (12:17:08) (in thread): > Right, You need to name thegenomeargument (Seqinfo(genome="mm9")). Otherwise, if you doSeqinfo("mm9"),"mm9"is passed to the 1st argument of theSeqinfo()function which isseqnames, and in that caseSeqinfo()constructs a Seqinfo object with whatever you supply to it.Seqinfo(genome="mm9")is a special way to call the Seqinfo constructor where it queries online resources to populate the Seqinfo object for you. There are examples of this in?Seqinfo.

2019-06-07

Dario Righelli (04:52:35) (in thread): > Yeah that’s right, but maybe it could be better to implement a way to retrieve the lengths also with singular chromosomes > > > Seqinfo(seqnames=c("chr1", "chr2"), genome="mm9") > Seqinfo object with 2 sequences from mm9 genome; no seqlengths: > seqnames seqlengths isCircular genome > chr1 NA NA mm9 > chr2 NA NA mm9 >

2019-06-08

Malisa S (13:53:15): > @Malisa S has joined the channel

Hervé Pagès (15:04:35) (in thread): > Seqinfo(genome="mm9")(with only thegenomeargument supplied) is a special way to call theSeqinfoconstructor. All the other ways just construct a Seqinfo object with whatever is supplied. If you want the seqlengths for chr1 and chr2 only, you can just subset the Seqinfo object returned bySeqinfo(genome="mm9"): > > > library(GenomeInfoDb) > > si <- Seqinfo(genome="mm9") > > si[c("chr1", "chr2")] > Seqinfo object with 2 sequences from mm9 genome: > seqnames seqlengths isCircular genome > chr1 197195432 FALSE mm9 > chr2 181748087 FALSE mm9 > > , or, if you want the seqlengths in an integer vector you can do: > > > seqlengths(si)[c("chr1", "chr2")] > chr1 chr2 > 197195432 181748087 > > Hope this helps.

2019-06-10

Isaac Virshup (02:27:10): > @Isaac Virshup has joined the channel

Simina Boca (12:33:24): > My review for BiocPkgTools was just postedThat was fun to check out! - Attachment (f1000research.com): F1000Research Article: BiocPkgTools: Toolkit for mining the Bioconductor package ecosystem. > Read the latest article version by Shian Su, Vincent J. Carey, Lori Shepherd, Matthew Ritchie, Martin T. Morgan, Sean Davis, at F1000Research.

2019-06-11

FelixErnst (07:45:05): > @FelixErnst has joined the channel

Peter Hickey (18:35:15): > BioCAsia is coming to Sydney on December 5-6. > On December 5 we’ll bring together researchers with an interest in > Bioconductor and R to exchange ideas and expertise. > On December 6 we’re teaming up with BioinfoSummer (Dec 2-6) to run > hands on bioinformatics training featuring Bioconductor software. > Following this, the joint International Conference on Genome > Informatics (formerly known as Genome Informatics Workshop; GIW) & > Australian Bioinformatics and Computational Biology Society (ABACBS) > Annual Conference will be held in Sydney Dec 9-11.@saskia,@Stevie Pederson,@Dario Strbenacand I are planning a > fun & welcoming meeting with talks & workshops for beginners through to > Gordon Smyth. > We’ll share more details soon, but for now please marks your calendars > and share with your colleagues. > We hope to see you there - Attachment (twitter): Attachment > BioCAsia is coming to Sydney on Dec 5-6! > Dec 5 we’ll bring together researchers with an interest in @Bioconductor & #rstats to exchange ideas & expertise. > Dec 6 we’re teaming up with @Bioinfosummer (Dec 2-6) to run hands on bioinformatics training featuring Bioconductor software. https://pbs.twimg.com/media/D80D6x2UcAAZxIQ.png

2019-06-12

Xuehan Zhang (12:46:45): > @Xuehan Zhang has joined the channel

Aaron Lun (20:42:47): > Does anyone know whether the support site supports mathjax? I thought it did, given its past propensity to latex-ify everything, but now I can’t get it to work:https://support.bioconductor.org/p/121786/#121864

Lori Shepherd (22:14:59): > I think when we did the updates at the beginning of the year it might have removed this capability. I think having it also interfered/made buggy other aspects of the site. I can look into it though. Please open a github issue so I don’t lose track?

Aaron Lun (22:35:03): > done.

Aaron Lun (22:54:27): > And does anyone know what thisexternal_data_store.txtis at the root of thescRNAseqR package (https://github.com/LTLA/scRNAseq)?

Aaron Lun (22:55:06): > My guess is that it harked back to a time when Bioconductor stored the data objects in data packages away from the repository, and in some… external store.

2019-06-13

Vince Carey (00:21:20): > external_data_store.txt was an essential part of the subversion-based approach for experiment data packages. Seehttps://www.mail-archive.com/bioc-devel@r-project.org/msg05625.htmlfor some discussion. You do not need this file in scRNAseq AFAICT.

Stuart Lee (21:18:27): > Just an FYI Github has a waitlist for their developer sponsorship programhttps://github.com/sponsors- might be worth Bioconductor and all the volunteer developers looking into. There’s also a formhttps://docs.google.com/forms/d/e/1FAIpQLSdE8nL7U-d7CBTWp9X7XOoezQD06wCzCAS9VpoUW6lJ03KU7w/viewformwhere you can nominate developers you would like to sponsor.

Aaron Lun (22:33:53): > aw man, that would make my taxes evenmorecomplicated.

Stuart Lee (23:53:04) (in thread): > weirdly i thought of you when posting this. i would consider it a charitable donation ha

2019-06-14

Aaron Lun (01:25:09) (in thread): > I forfeited my IP rights when I got on the industry gravy train

Aaron Lun (01:25:40) (in thread): > thanks vince…. subversion - that’s a blast from the past…

2019-06-16

Keith Connolly (21:12:59): > @Keith Connolly has joined the channel

2019-06-17

Luka (08:32:14): > Hey all! > After reading in a BAM file with scanBam, is it possible to change the base quality scores in bam$qual from letters/characters into numbers? I know about thealphabetScore( )function which returns the sum of the individual base quality scores within a read (which can then be divided by the read length to get the average base quality score in given read), but I’m wondering whether I can somehow retrieve the actual base quality scores at each position for each read in a BAM file?

Mike Smith (10:06:31) (in thread): > as(quality(ShortReadQ), "matrix")should do it.

Rob Amezquita (10:10:12): > is anyone finding that on macOS (mojave) that XQuartz is beingextremelynonresponsive? even for small datasets or justplot(1,1)it is hanging for ages on my laptop.

Luka (10:15:08) (in thread): > amazing… thanks a lot!

FelixErnst (10:55:13) (in thread): > Two things out of curiosity: the ShortReadQ class is from the ShortRead package, isn’t it? In the package it is mentioned, that it is designed to work with FastQ files, and references the Rsamtools package for working with bam files.@Mike SmithDoes it work nonetheless? The second thing: what happens if the bam files contains of two or more read lengths? a conversion to a matrix would fail, wouldn’t it?

FelixErnst (10:59:10) (in thread): > my take for this would beas(mcols(readGAlignments(file, param = ScanBamParam(what = "qual")))$qual,"IntegerList")

Quy Cao (13:04:24): > @Quy Cao has joined the channel

2019-06-18

Mike Smith (03:18:17) (in thread): > My assumption was that ifalphabetScore()was in play, but not appropriate, then we were already looking at the ShortRead package. > > No idea what it would do with variable read lengths. I guess my old scripts are all based on Illumina data.

FelixErnst (03:43:46) (in thread): > Sure. But is the question about adapter clipped mapped reads or raw reads from a device just as stored as bam file?

Luka (05:37:38) (in thread): > Adapter clipped mapped reads

Mike Smith (08:26:24) (in thread): > In which case Felix’s answer is presumably more robust. Our of interest, what happens if you try my approach? Is it even possible or does it work on the wrong class?

Luka (08:53:51) (in thread): > I first need to convert the bam$seq and bam$qual into aShortReadQobject and then i do what you suggested: > > foo <- ShortReadQ(sread = bam$seq, quality = bam$qual) > as(quality(foo), "matrix") > > This outputs a matrix of dimensions (number of reads, number of cycles in sequencing run) and for reads which are shorter than the number of cycles in sequencing runs you get NAs after the last base quality score

FelixErnst (11:15:02) (in thread): > Just out of curiosity: what do you want to do with this? If you want access sequences with certain quality scores, I would stick to the IntegerList. This can easily be turned into a LogicalList for further use. Coordinates can be retrieved like this and used to subset: > > ga <- readGAlignments(file, param = ScanBamParam(what = c("qual","seq"))) > seq <- mcols(ga)$seq > qual <- as(mcols(ga)$qual,"IntegerList") > rle <- RleList(qual < 30L) > irl <- as(rle,"IRangesList") > seq[irl] >

Luka (11:30:42) (in thread): > I wanted to get a feel for the distribution of base quality scores in my BAM file… but after importing the BAM file I realised they’re not encoded in integers so I needed to convert them somehow… I’m not familiar with theIntegerListclass but i’ll make sure to check it out… thanks both of you for your inputs! much appreciated

FelixErnst (11:47:55) (in thread): > Continuing from the example abovehist(unlist(qual))

Burak Kutlu (18:38:08): > @Burak Kutlu has joined the channel

2019-06-19

ZainabAlTaie (13:43:35): > @ZainabAlTaie has joined the channel

ZainabAlTaie (13:49:12): > Hi, I am wondering if there is a channel that includes ChAMP related questions.

Peter Hickey (17:36:08) (in thread): > Probably best posted tohttp://support.bioconductor.org

Shian Su (20:08:29): > Playing around with fast5 files if anyone wants to join in the fun:https://github.com/shians/porexplorer

Aaron Lun (20:13:12): > ah, my old nemesis.

Shian Su (20:15:47): > I was a bit surprised I couldn’t find any R tools that just read in a squiggle to play around with.

Shian Su (20:16:39): > The R landscape for squiggles is some bizzaro world.

Shian Su (20:20:03): > poRe is a SourceForge project with documentation on Github. NanoR is on Github, is 90% organised to be an installable package, but instead includes a tarzipped file in the repo that you’re meant to install from.

Aaron Lun (20:25:20): > I’m just all albacore and give me FASTQs.

Aaron Lun (20:25:34): > Just CBFing so hard.

Peter Hickey (21:41:38): > don’t think that’s an option for Shian, unfortunately, as he’s looking at base modifications

2019-06-20

Charlotte Soneson (03:36:03) (in thread): > There’s alsohttp://www.bioconductor.org/packages/IONiseR/ - Attachment (Bioconductor): IONiseR > IONiseR provides tools for the quality assessment of Oxford Nanopore MinION data. It extracts summary statistics from a set of fast5 files and can be used either before or after base calling. In addition to standard summaries of the read-types produced, it provides a number of plots for visualising metrics relative to experiment run time or spatially over the surface of a flowcell.

Harry Danwonno (07:24:18): > @Harry Danwonno has joined the channel

Luca Parmigiani (09:31:31): > @Luca Parmigiani has joined the channel

Nick Gomez (09:52:34): > @Nick Gomez has joined the channel

Robert Castelo (10:10:41): > @Robert Castelo has joined the channel

Rama Shankar (10:24:55): > @Rama Shankar has joined the channel

Hua Ling (10:41:15): > @Hua Ling has joined the channel

Tamas Kiss (10:57:26): > @Tamas Kiss has joined the channel

Marcello (11:39:28): > @Marcello has joined the channel

Robert Williams (11:52:01): > @Robert Williams has joined the channel

Yan Zhang (12:12:04): > @Yan Zhang has joined the channel

ZainabAlTaie (12:13:41) (in thread): > Thank you very much!

Sanjeev Sariya (12:24:18): > @Sanjeev Sariya has joined the channel

Tanya Grancharova (12:49:58): > @Tanya Grancharova has joined the channel

Sanjeev Sariya (13:10:50): > Was wondering How could I get this interface on an android app?

Lori Shepherd (13:14:48) (in thread): > what interface are you referring, slack?

Lori Shepherd (13:17:48) (in thread): > https://slack.com/downloads/android - Attachment (Slack): Android | Downloads > Download Slack for free for mobile devices or desktop. Keep up with the conversation with our apps for iPhone, Android, Windows Phone and more.

Thomas Sandmann (13:50:21): > @Thomas Sandmann has joined the channel

Sanjeev Sariya (14:24:53) (in thread): > I’m using this via interface desktop on a browser.:grimacing:

Sanjeev Sariya (14:28:00) (in thread): > Got it on android. Thank you….:slightly_smiling_face:

Mike Smith (15:48:31) (in thread): > I’d be keen to know if this is compatible with the current files ONT are producing. It was such a hassle keeping up with the format changes they introduced. If it doesn’t work, I’m happy to implement updates

Ghada Soliman (16:59:34): > @Ghada Soliman has joined the channel

Sanjeev Sariya (17:40:43): > I was wondering if it would be great to hear on rnaseq, say a new channel? scrna seq is of course cutting-edge, but tough to get due to expenses.

Aaron Lun (19:14:05): > @Sanjeev Sariyawhat do you want to talk about in that channel? Bulk RNA-seq analysis software is pretty well established, so there’s relatively little scope for further development compared to other things.

Aaron Lun (19:14:22): > I mean, it costs nothing to open a channel, but you just might not get any activity on it.

Aaron Lun (19:14:37): > If you’ve got questions on how to use the software, then the support site is the place to ask.

Shian Su (19:26:45) (in thread): > I haven’t implemented multi-fast5 yet, it’s not difficult but I’m trying to think about abstractions.

Shian Su (19:32:03) (in thread): > I want a Fast5Collection class that represents a collection of reads whether they are single or multi-reads.

Domenick Braccia (21:07:11): > @Sanjeev SariyaThere is a really good channel on the r-bioinformatics slack page called #transcriptomics (an off-shoot of thereddit.com/r/bioinformaticsforum) . There is very frequent discussion on that channel about all things RNA-seq, and there are a dedicated few who take the time to answer almost every question posted there. I highly recommend it.

Aaron Lun (23:25:49): > AFAIK, Slack messages and threads aren’t indexed/searchable by Google, etc. - so unless they’re transcribing the message threads to a more persistent and accessible location, it seems like a return to the old Usenet days where the same questions would get asked and answered again, and again, and again…

Aaron Lun (23:31:13): > Not that I actually used Usenet. I wasn’t even alive when it was still a big thing.

2019-06-21

Sanjeev Sariya (10:31:58) (in thread): > @Domenick Braccia- Thanks for pointing me to the right resource. I’m waiting for invite link! Appreciate your reply.:slightly_smiling_face:

Sanjeev Sariya (10:38:57) (in thread): > @Aaron LunMakes sense. > Appreciate your reply.!:slightly_smiling_face:

Burton Karger (11:04:54): > @Burton Karger has joined the channel

John Lawson (11:56:28): > @John Lawson has joined the channel

Domenick Braccia (12:00:13): > @Aaron LunIn that case I’d suggest@Sanjeev Sariyato look up some of the questions that have already been asked on the r/bioinformatics forum or bioconductor help page to see if there’s an answer out there for the question.

Komal Rathi (12:19:06): > @Komal Rathi has joined the channel

Hena Ramay (23:56:35): > @Hena Ramay has joined the channel

2019-06-22

Michael Love (13:54:33): > Note to S4 people:https://github.com/rstudio/rstudio/issues/4741

Michael Love (13:55:59): > I’ve gotten at least three reports this week of this. Users are confused by the description of the S4 objects in the Environment pane. There’s nothing wrong, but it’s confusing.

Michael Love (13:56:59): > Eg here’s an example posthttps://support.bioconductor.org/p/122184/#122188

Martin Morgan (15:10:12): > Thanks@Michael LoveI added a little commentary to the rstudio issue; perhaps it’ll encourage a different response from them…

Hervé Pagès (22:00:21): > Starting with SummarizedExperiment 1.15.4, SE objects don’t use ref classes internally (https://github.com/Bioconductor/SummarizedExperiment/commit/e8a159a81e8d5805c7781301bcd5c68d066531d7) so they’ll no longer be reported as “