-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nanostring update 20181010 #50
base: master
Are you sure you want to change the base?
Nanostring update 20181010 #50
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Congrats on the first pull request! 🎉 🎉 looks great - just a couple of comments/questions
dplyr::arrange(desc(rfFreq)) %>% | ||
dplyr::top_n(n = top_n_genes) %>% | ||
dplyr::mutate(genes = toupper(genes)) | ||
dplyr::filter(genes == "BOP1" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lets do it this way instead - I think it is cleaner:
capture_genes <- c("BOP1", "DNAI1", "HSF1", "LRRC50", "MS4A3",
"NTN2L", "SHARPIN", "SLC12A3", "SOX10", "TSNAXIP1")
classifier_df <- readr::read_csv(file) %>%
dplyr::filter(genes %in% capture_genes)
@@ -14,12 +14,40 @@ library(dplyr) | |||
|
|||
file <- file.path("7.Nanostring", "data", "overallFreqs.csv") | |||
|
|||
# OPTION 1: Classifier genes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What is the plan currently with this? Are we going to go with the different options and run independently?
Yay! Glad everything made sense. I like the filtering change.
For your second question, my "quick fix" plan was to have the different
options available to comment in/out as needed. I made this choice because
I was unsure of what steps in parts B-F I would need to change if I had all
of the gene sets lead to separate outputs (e.g., would I need to have 4
separate pipelines or somehow further streamline the output file naming
process?). With this "quick fix" all I have to do is rename the "figures"
and "results" folders at the completion of each full-pipeline run and run
again with the next gene set. I'm open to more elegant solutions.
…On Wed, Oct 10, 2018 at 1:32 PM Greg Way ***@***.***> wrote:
***@***.**** commented on this pull request.
Congrats on the first pull request! 🎉 🎉 looks great - just a couple of
comments/questions
------------------------------
In 7.Nanostring/scripts/A.get_correlation_output.R
<#50 (comment)>
:
> classifier_df <- readr::read_csv(file) %>%
- dplyr::arrange(desc(rfFreq)) %>%
- dplyr::top_n(n = top_n_genes) %>%
- dplyr::mutate(genes = toupper(genes))
+ dplyr::filter(genes == "BOP1"
lets do it this way instead - I think it is cleaner:
capture_genes <- c("BOP1", "DNAI1", "HSF1", "LRRC50", "MS4A3",
"NTN2L", "SHARPIN", "SLC12A3", "SOX10", "TSNAXIP1")
classifier_df <- readr::read_csv(file) %>%
dplyr::filter(genes %in% capture_genes)
------------------------------
In 7.Nanostring/scripts/A.get_correlation_output.R
<#50 (comment)>
:
> @@ -14,12 +14,40 @@ library(dplyr)
file <- file.path("7.Nanostring", "data", "overallFreqs.csv")
+# OPTION 1: Classifier genes
What is the plan currently with this? Are we going to go with the
different options and run independently?
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#50 (review)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ApRdfMSFuUPrrPsOZaA0Y0g9NaL3x0BWks5ujktqgaJpZM4XWJGH>
.
--
Mollie Barnard, ScD
Postdoctoral Fellow
Huntsman Cancer Institute
Department of Population Health Sciences
University of Utah
2000 Circle of Hope, Rm 4748
Salt Lake City, UT 84112
Phone: (801) 213-6006
[email protected]
|
This sounds good if you're looking to get results and iterate quickly. If this is the case, I am on board. However, it is not a good long term solution from a maintainability perspective. I think we should run the pipeline through with an alternative set (like the extra 10 genes) and see what other issues we run into and in which scripts. When I wrote this I originally didn't intend for it to be run using different genes, and we're paying the price now! We should probably address any issues we run into. For this particular solution, we may want to consider switching to an |
The following pull request adds, fixes, and/or modifies:
I performed the following prior to filing this pull request:
Lastly, I will assign individuals to review my code and be proud of making the pull request!