Pipeline #148

klapaukh · 2017-11-21T03:16:02Z

This basically allows you to create ccd objects not from the XML or the RData, but rather than the database using the following:
con <- connect(username="something", database="something")
ccd <- table.to.ccdata(exportData(con) %>% collect, metadata(connection = con))

The only real changes are in R/dbConnection.R [and the README but those are trivial]

codecov-io · 2017-11-21T03:16:04Z

Codecov Report

Merging #148 into master will decrease coverage by 3.08%.
The diff coverage is 0%.

@@            Coverage Diff             @@
##           master     #148      +/-   ##
==========================================
- Coverage   75.65%   72.56%   -3.09%     
==========================================
  Files          14       15       +1     
  Lines        1269     1323      +54     
==========================================
  Hits          960      960              
- Misses        309      363      +54

Impacted Files	Coverage Δ
R/dbConnection.R	`0% <0%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a21af68...a5995eb. Read the comment docs.

jonc125 · 2017-11-21T08:58:20Z

Is the AppVeyor failure specific to this PR, or a long-standing problem?

jonc125 · 2017-11-21T09:00:20Z

Seems to have been failing for a while, including all of #145. From #135 it may be caused by importing dplyr?

jonc125

Generally looks good (though I don't really know R!). My main comment would be that it needs a little more user-oriented documentation & commenting if others are to use it. But I agree the focus should be getting it to work first :)

jonc125 · 2017-11-21T09:01:54Z

R/dbConnection.R

+            names
+
+          if(length(types) == 1) {
+              if(cname == 'NIHR_HIC_ICU_0005'){


Worth adding a comment explaining this special case.

I added a comment about this

jonc125 · 2017-11-21T09:05:15Z

README.md

 * data.table,
-* yaml,
+* dbplyr,


This isn't mentioned earlier? Typo?

dbplyr must be installed on the system. But you never explicitly use it. It is a bit strange that way

jonc125 · 2017-11-21T09:05:55Z

README.md

 * pander,
 * Rcpp,
-* methods
+* RPostgreSQL,
+* XML,


Presumably once the DB connector is working we could strip out the old XML handling code?

But not in this PR!

I don't know about that. Based on Sinan's messages about the package being general purpose, keeping the XML handling seems like it could be sensible since it may not be used just for the database we have?

I'll ask this morning :)

jonc125 · 2017-11-21T09:07:22Z

R/dbConnection.R

+#' codes that are available with their meanings, and
+#' what the different columns mean for them. 
+#'
+#' This functions returns a loaded into memory R table,


s/functions/function/

jonc125 · 2017-11-21T09:08:10Z

R/dbConnection.R

+
+#' The metadata table. This is all of the different 
+#' codes that are available with their meanings, and
+#' what the different columns mean for them. 


Is it worth adding to this what the structure of the table is, so callers know how to use it?

I have given that a go

sinanshi · 2017-11-21T10:18:23Z

It is really great to see you guys starting to implement the database approach. Although I don't know exactly what you are planning to do, cleanEHR now has been branded as a more generic data cleaning package, at least this is what we said in the JOSS paper, which is still under review. Moving from ccdata to database fits the purpose well. As you named the branch as "pipeline", I wonder if this is the data processing pipeline only for CCHIC, e.g. import XML files from hospitals to the database, you might want to consider to move it to a separate package, at least until the JOSS paper is published.

dpshelio · 2017-11-21T10:25:15Z

@sinanshi - the database importer is done in a different package, this is only to read from a database and generate a ccdata object.

sinanshi · 2017-11-21T13:48:00Z

@dpshelio that's great.

jonc125

Looks good to me!

jonc125 · 2017-11-22T08:40:19Z

R/dbConnection.R

 #'
-#' This functions returns a loaded into memory R table,
+#' * Code_name is the XML code of this type of data


Is that actually a capital C? Would be nice to be consistent with the other column names if it is.

jonc125 · 2017-11-22T08:40:33Z

R/dbConnection.R

-#' This functions returns a loaded into memory R table,
+#' * Code_name is the XML code of this type of data
+#' * long_name is an english description of what it is
+#' * primary_column is what columns of the events table 


s/columns/column/

(I know, I'm nitpicking!)

klapaukh requested review from dpshelio and jonc125 November 21, 2017 03:16

docsteveharris added the Waiting For Review label Nov 21, 2017

jonc125 suggested changes Nov 21, 2017

View reviewed changes

jonc125 approved these changes Nov 22, 2017

View reviewed changes

klapaukh and others added 9 commits January 16, 2018 14:28

Added some code to try connect ccdata with database

f95c63d

Redocument following rebase

8f086e9

Fix typing and object creation

ea9ce88

Clean up code to remove stuff that was a misunderstanding.

16cc765

Fix bugs in colum names that break anonymisation. Touch up documentation

b61a11a

Add some extra documentation

baa41af

typo

c615ba5

more typos

542445c

Rebased and updated library

f0d5d25

dpshelio force-pushed the pipeline branch from d923649 to f0d5d25 Compare January 16, 2018 14:42

Fix incorrect variable name

a5995eb

dpshelio mentioned this pull request Feb 1, 2018

Issue with table.to.ccdata #154

Closed

maelle closed this Aug 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline #148

Pipeline #148

klapaukh commented Nov 21, 2017

codecov-io commented Nov 21, 2017 •

edited

Loading

jonc125 commented Nov 21, 2017

jonc125 commented Nov 21, 2017

jonc125 left a comment

jonc125 Nov 21, 2017

klapaukh Nov 21, 2017

jonc125 Nov 21, 2017

klapaukh Nov 21, 2017

jonc125 Nov 21, 2017

klapaukh Nov 21, 2017

jonc125 Nov 22, 2017

jonc125 Nov 21, 2017

jonc125 Nov 21, 2017

klapaukh Nov 21, 2017

sinanshi commented Nov 21, 2017

dpshelio commented Nov 21, 2017

sinanshi commented Nov 21, 2017

jonc125 left a comment

jonc125 Nov 22, 2017

jonc125 Nov 22, 2017

Pipeline #148

Pipeline #148

Conversation

klapaukh commented Nov 21, 2017

codecov-io commented Nov 21, 2017 • edited Loading

Codecov Report

jonc125 commented Nov 21, 2017

jonc125 commented Nov 21, 2017

jonc125 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sinanshi commented Nov 21, 2017

dpshelio commented Nov 21, 2017

sinanshi commented Nov 21, 2017

jonc125 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov-io commented Nov 21, 2017 •

edited

Loading