Export labels to statistical software

Hi guys,

our research team is planning a panel household survey (1500
households), for which we want to use CAPI software on mobile devices.
ODK looks very interesting. One question I couldn't find an answer to
is, is it possible to export the data to a statistical analysis
software (Stata, SAS...) with variable and value labels?

Thanks for any help

Olivier

From ODK Aggregate you can export all your data in .csv files which
can be imported into most statistical packages.

··· On Wed, Dec 7, 2011 at 3:28 AM, Olivier Kalmus wrote: > Hi guys, > > our research team is planning a panel household survey (1500 > households), for which we want to use CAPI software on mobile devices. > ODK looks very interesting. One question I couldn't find an answer to > is, is it possible to export the data to a statistical analysis > software (Stata, SAS...) with variable and value labels? > > Thanks for any help > > Olivier > > -- > Post: opendatakit@googlegroups.com > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > Options: http://groups.google.com/group/opendatakit?hl=en

Dear ODK developers,

I am also new to ODK and I would like to thank you for the great
programme(s). It is a big step forward in digital data collection.
However, like the original poster described it, I am also severely
missing a possibility to export data to statistical programmes such as
Stata or R including variable labels and value labels, i.e.
essentially the underlying logic of the questionnaire. Before, we were
using pencil paper questionnaires and the free (but not open source)
CSPro as data entry tool. It was possible to directly export to Stata,
which was extremely handy. Any change in the questionnaire was
immediately and correctly reflected in the Stata export. As any
digital questionnaire includes the information of (a) the question,
(b) variable name, (c) answer categories in words and (d) associated
values in numbers. As far as I understood, (a)-(c) would be lost in
ODK .csv export and only (d) is kept. Is that correct? IMHO, this
would mean much (error-prone) duplication of work.

How difficult is it to include (a)-(c) and to develop an export to
statistical programmes? Has already anyone started to write an export
filter to one of these statistical programmes? Thank you very much!

Cheers and Happy New Year,
Gerry

For those using STATA there is a plugin (ODKMETA) for STATA that is used
to include value labels in output. You will need the original XLS
questionnaire with the survey and choices sheet and the CSV output from ODK
Server. You may download at this link:

··· On Wednesday, December 7, 2011 at 1:28:48 PM UTC+2, Olivier Kalmus wrote: > > Hi guys, > > our research team is planning a panel household survey (1500 > households), for which we want to use CAPI software on mobile devices. > ODK looks very interesting. One question I couldn't find an answer to > is, is it possible to export the data to a statistical analysis > software (Stata, SAS...) with variable and value labels? > > Thanks for any help > > Olivier > >

also, if you are considering using odk for capi, the worldbank did an
evaluation of a bunch of mobile data collection tools (including odk)
for capi. might be good to take a look through it to understand the
tradeoffs...

http://web.worldbank.org/WBSITE/EXTERNAL/EXTDEC/EXTRESEARCH/EXTLSMS/EXTSURAGRI/0,,contentMDK:22939078~pagePK:64168445~piPK:64168309~theSitePK:7420261,00.html

··· On Wed, Dec 7, 2011 at 08:58, Gaetano Borriello wrote: > From ODK Aggregate you can export all your data in .csv files which > can be imported into most statistical packages. > > > On Wed, Dec 7, 2011 at 3:28 AM, Olivier Kalmus wrote: >> Hi guys, >> >> our research team is planning a panel household survey (1500 >> households), for which we want to use CAPI software on mobile devices. >> ODK looks very interesting. One question I couldn't find an answer to >> is, is it possible to export the data to a statistical analysis >> software (Stata, SAS...) with variable and value labels? >> >> Thanks for any help >> >> Olivier >> >> -- >> Post: opendatakit@googlegroups.com >> Unsubscribe: opendatakit+unsubscribe@googlegroups.com >> Options: http://groups.google.com/group/opendatakit?hl=en > > -- > Post: opendatakit@googlegroups.com > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > Options: http://groups.google.com/group/opendatakit?hl=en

That information is stored in the xform which should be uniquely
correlated to the answers that are output in the csv. The csv only
represents the answers for that unique identifier. It sounds like what
you want is a file that contains everything in one file (form
information + csv of answers). I do not know of a tool that currently
does that.

If somebody wants to write some code to do this that would be great.
If someone does write some code send the ODK team a copy of the freely
licensed code and we will try to work it into the tools.

A basic starting point is to file an issue on our website that
requests this feature and shows how the information should appear. For
us to implement this we would have to go research the proper formats.
You could help us with this by filing an issue with links to the
format specification and a couple of examples (can contain fake data)
that show the original xform used to gather data, the original csv
export along with a file that shows what the desired output format
that combined them. The issue tracker allows you to attach files.

Cheers,
Waylon

··· On Wed, Jan 4, 2012 at 9:35 AM, Gerry wrote: > Dear ODK developers, > > I am also new to ODK and I would like to thank you for the great > programme(s). It is a big step forward in digital data collection. > However, like the original poster described it, I am also severely > missing a possibility to export data to statistical programmes such as > Stata or R including variable labels and value labels, i.e. > essentially the underlying logic of the questionnaire. Before, we were > using pencil paper questionnaires and the free (but not open source) > CSPro as data entry tool. It was possible to directly export to Stata, > which was extremely handy. Any change in the questionnaire was > immediately and correctly reflected in the Stata export. As any > digital questionnaire includes the information of (a) the question, > (b) variable name, (c) answer categories in words and (d) associated > values in numbers. As far as I understood, (a)-(c) would be lost in > ODK .csv export and only (d) is kept. Is that correct? IMHO, this > would mean much (error-prone) duplication of work. > > How difficult is it to include (a)-(c) and to develop an export to > statistical programmes? Has already anyone started to write an export > filter to one of these statistical programmes? Thank you very much! > > Cheers and Happy New Year, > Gerry > > -- > Post: opendatakit@googlegroups.com > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > Options: http://groups.google.com/group/opendatakit?hl=en

https://github.com/PovertyAction/odkmeta is the authoritative link.

··· On Wed, Feb 1, 2017 at 7:25 PM, Vincent Mkandawire wrote: > For those using STATA there is a plugin (ODKMETA) for STATA that is used to > include value labels in output. You will need the original XLS questionnaire > with the survey and choices sheet and the CSV output from ODK Server. You > may download at this link: https://ideas.repec.org/c/boc/bocode/s457767.html > > On Wednesday, December 7, 2011 at 1:28:48 PM UTC+2, Olivier Kalmus wrote: >> >> Hi guys, >> >> our research team is planning a panel household survey (1500 >> households), for which we want to use CAPI software on mobile devices. >> ODK looks very interesting. One question I couldn't find an answer to >> is, is it possible to export the data to a statistical analysis >> software (Stata, SAS...) with variable and value labels? >> >> Thanks for any help >> >> Olivier > > -- > -- > Post: opendatakit@googlegroups.com > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > Options: http://groups.google.com/group/opendatakit?hl=en > > --- > You received this message because you are subscribed to the Google Groups > "ODK Community" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to opendatakit+unsubscribe@googlegroups.com. > For more options, visit https://groups.google.com/d/optout.

Thanks for the replies so far. I do have that CAPI report from the
World Bank.
In the report they state that labels cannot be transferred to a
statistical software from ODK, whereas in the appendix it says
difficult but possible.
While this is not a major problem, having the possibility of exporting
the labels would be great to have.

··· On Dec 7, 3:02 pm, Yaw Anokwa wrote: > also, if you are considering using odk for capi, the worldbank did an > evaluation of a bunch of mobile data collection tools (including odk) > for capi. might be good to take a look through it to understand the > tradeoffs... > > http://web.worldbank.org/WBSITE/EXTERNAL/EXTDEC/EXTRESEARCH/EXTLSMS/E... > > On Wed, Dec 7, 2011 at 08:58, Gaetano Borriello wrote: > > From ODK Aggregate you can export all your data in .csv files which > > can be imported into most statistical packages. > > > On Wed, Dec 7, 2011 at 3:28 AM, Olivier Kalmus wrote: > >> Hi guys, > > >> our research team is planning a panel household survey (1500 > >> households), for which we want to use CAPI software on mobile devices. > >> ODK looks very interesting. One question I couldn't find an answer to > >> is, is it possible to export the data to a statistical analysis > >> software (Stata, SAS...) with variable and value labels? > > >> Thanks for any help > > >> Olivier > > >> -- > >> Post: opendatakit@googlegroups.com > >> Unsubscribe: opendatakit+unsubscribe@googlegroups.com > >> Options:http://groups.google.com/group/opendatakit?hl=en > > > -- > > Post: opendatakit@googlegroups.com > > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > > Options:http://groups.google.com/group/opendatakit?hl=en

oliver,

can you give me an example of what you call a "label"? i'm assuming in
a select prompt, it's what is shown to the user, but not the
underlying value that is saved in the form? if that is the case, why
can't you use the value?

yaw

··· On Wed, Dec 7, 2011 at 07:52, Olivier Kalmus wrote: > Thanks for the replies so far. I do have that CAPI report from the > World Bank. > In the report they state that labels cannot be transferred to a > statistical software from ODK, whereas in the appendix it says > difficult but possible. > While this is not a major problem, having the possibility of exporting > the labels would be great to have. > > On Dec 7, 3:02 pm, Yaw Anokwa wrote: >> also, if you are considering using odk for capi, the worldbank did an >> evaluation of a bunch of mobile data collection tools (including odk) >> for capi. might be good to take a look through it to understand the >> tradeoffs... >> >> http://web.worldbank.org/WBSITE/EXTERNAL/EXTDEC/EXTRESEARCH/EXTLSMS/E... >> >> On Wed, Dec 7, 2011 at 08:58, Gaetano Borriello wrote: >> > From ODK Aggregate you can export all your data in .csv files which >> > can be imported into most statistical packages. >> >> > On Wed, Dec 7, 2011 at 3:28 AM, Olivier Kalmus wrote: >> >> Hi guys, >> >> >> our research team is planning a panel household survey (1500 >> >> households), for which we want to use CAPI software on mobile devices. >> >> ODK looks very interesting. One question I couldn't find an answer to >> >> is, is it possible to export the data to a statistical analysis >> >> software (Stata, SAS...) with variable and value labels? >> >> >> Thanks for any help >> >> >> Olivier >> >> >> -- >> >> Post: opendatakit@googlegroups.com >> >> Unsubscribe: opendatakit+unsubscribe@googlegroups.com >> >> Options:http://groups.google.com/group/opendatakit?hl=en >> >> > -- >> > Post: opendatakit@googlegroups.com >> > Unsubscribe: opendatakit+unsubscribe@googlegroups.com >> > Options:http://groups.google.com/group/opendatakit?hl=en > > -- > Post: opendatakit@googlegroups.com > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > Options: http://groups.google.com/group/opendatakit?hl=en

Gerry,

I've filed a feature request at
http://code.google.com/p/opendatakit/issues/detail?id=495 if you want
to help us specify the feature and track progress.

Yaw

··· On Wed, Jan 4, 2012 at 11:45, W. Brunette wrote: > That information is stored in the xform which should be uniquely > correlated to the answers that are output in the csv. The csv only > represents the answers for that unique identifier. It sounds like what > you want is a file that contains everything in one file (form > information + csv of answers). I do not know of a tool that currently > does that. > > If somebody wants to write some code to do this that would be great. > If someone does write some code send the ODK team a copy of the freely > licensed code and we will try to work it into the tools. > > A basic starting point is to file an issue on our website that > requests this feature and shows how the information should appear. For > us to implement this we would have to go research the proper formats. > You could help us with this by filing an issue with links to the > format specification and a couple of examples (can contain fake data) > that show the original xform used to gather data, the original csv > export along with a file that shows what the desired output format > that combined them. The issue tracker allows you to attach files. > > Cheers, > Waylon > > On Wed, Jan 4, 2012 at 9:35 AM, Gerry wrote: >> Dear ODK developers, >> >> I am also new to ODK and I would like to thank you for the great >> programme(s). It is a big step forward in digital data collection. >> However, like the original poster described it, I am also severely >> missing a possibility to export data to statistical programmes such as >> Stata or R including variable labels and value labels, i.e. >> essentially the underlying logic of the questionnaire. Before, we were >> using pencil paper questionnaires and the free (but not open source) >> CSPro as data entry tool. It was possible to directly export to Stata, >> which was extremely handy. Any change in the questionnaire was >> immediately and correctly reflected in the Stata export. As any >> digital questionnaire includes the information of (a) the question, >> (b) variable name, (c) answer categories in words and (d) associated >> values in numbers. As far as I understood, (a)-(c) would be lost in >> ODK .csv export and only (d) is kept. Is that correct? IMHO, this >> would mean much (error-prone) duplication of work. >> >> How difficult is it to include (a)-(c) and to develop an export to >> statistical programmes? Has already anyone started to write an export >> filter to one of these statistical programmes? Thank you very much! >> >> Cheers and Happy New Year, >> Gerry >> >> -- >> Post: opendatakit@googlegroups.com >> Unsubscribe: opendatakit+unsubscribe@googlegroups.com >> Options: http://groups.google.com/group/opendatakit?hl=en > > -- > Post: opendatakit@googlegroups.com > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > Options: http://groups.google.com/group/opendatakit?hl=en

Hi Oliver,

Its true what YAW have told you, the data can be exported as CSV which is
compatible with some statistical software, currently am i'm a statistician
and uses R for my analysis. R is able to get data from Google Docs.This is
how it's done, download R using this link
http://cran.r-project.org/mirrors.html. The software is open source and
quite strong.

then install R studio which has a better user interface. press on import
data set, select from web URL, write the appspot url and the data will be
Viewed on the R script. Then change the name from app id to the name of the
data sheet. select Yes for heading, select comma for separator, decimal put
period, quote write None. ten press import. then from there you are able to
analyse the data.

Let me know if you manage this, also let me know the Statistical software
you use i try, you have indicated (SAS, Stata,.......)

Regards,
Peter Kaiga

Cell: +254 723295150.

··· On Fri, Dec 9, 2011 at 8:10 AM, Yaw Anokwa wrote:

oliver,

can you give me an example of what you call a "label"? i'm assuming in
a select prompt, it's what is shown to the user, but not the
underlying value that is saved in the form? if that is the case, why
can't you use the value?

yaw

On Wed, Dec 7, 2011 at 07:52, Olivier Kalmus o.kalmus@googlemail.com wrote:

Thanks for the replies so far. I do have that CAPI report from the
World Bank.
In the report they state that labels cannot be transferred to a
statistical software from ODK, whereas in the appendix it says
difficult but possible.
While this is not a major problem, having the possibility of exporting
the labels would be great to have.

On Dec 7, 3:02 pm, Yaw Anokwa yano...@gmail.com wrote:

also, if you are considering using odk for capi, the worldbank did an
evaluation of a bunch of mobile data collection tools (including odk)
for capi. might be good to take a look through it to understand the
tradeoffs...

http://web.worldbank.org/WBSITE/EXTERNAL/EXTDEC/EXTRESEARCH/EXTLSMS/E.
..

On Wed, Dec 7, 2011 at 08:58, Gaetano Borriello gaet...@cs.washington.edu wrote:

From ODK Aggregate you can export all your data in .csv files which
can be imported into most statistical packages.

On Wed, Dec 7, 2011 at 3:28 AM, Olivier Kalmus < o.kal...@googlemail.com> wrote:

Hi guys,

our research team is planning a panel household survey (1500
households), for which we want to use CAPI software on mobile
devices.
ODK looks very interesting. One question I couldn't find an answer to
is, is it possible to export the data to a statistical analysis
software (Stata, SAS...) with variable and value labels?

Thanks for any help

Olivier

--
Post: opendatakit@googlegroups.com
Unsubscribe: opendatakit+unsubscribe@googlegroups.com
Options:http://groups.google.com/group/opendatakit?hl=en

--
Post: opendatakit@googlegroups.com
Unsubscribe: opendatakit+unsubscribe@googlegroups.com
Options:http://groups.google.com/group/opendatakit?hl=en

--
Post: opendatakit@googlegroups.com
Unsubscribe: opendatakit+unsubscribe@googlegroups.com
Options: http://groups.google.com/group/opendatakit?hl=en

--
Post: opendatakit@googlegroups.com
Unsubscribe: opendatakit+unsubscribe@googlegroups.com
Options: http://groups.google.com/group/opendatakit?hl=en

Hey, I finally found some time to upload a sample file of what I was
looking for on the issue tracker.

Olivier

··· On Jan 6, 1:03 am, Yaw Anokwa wrote: > Gerry, > > I've filed a feature request athttp://code.google.com/p/opendatakit/issues/detail?id=495if you want > to help us specify the feature and track progress. > > Yaw > > > > > > > > On Wed, Jan 4, 2012 at 11:45, W. Brunette wrote: > > That information is stored in the xform which should be uniquely > > correlated to the answers that are output in the csv. The csv only > > represents the answers for that unique identifier. It sounds like what > > you want is a file that contains everything in one file (form > > information + csv of answers). I do not know of a tool that currently > > does that. > > > If somebody wants to write some code to do this that would be great. > > If someone does write some code send the ODK team a copy of the freely > > licensed code and we will try to work it into the tools. > > > A basic starting point is to file an issue on our website that > > requests this feature and shows how the information should appear. For > > us to implement this we would have to go research the proper formats. > > You could help us with this by filing an issue with links to the > > format specification and a couple of examples (can contain fake data) > > that show the original xform used to gather data, the original csv > >exportalong with a file that shows what the desired output format > > that combined them. The issue tracker allows you to attach files. > > > Cheers, > > Waylon > > > On Wed, Jan 4, 2012 at 9:35 AM, Gerry wrote: > >> Dear ODK developers, > > >> I am also new to ODK and I would like to thank you for the great > >> programme(s). It is a big step forward in digital data collection. > >> However, like the original poster described it, I am also severely > >> missing a possibility toexportdata to statistical programmes such as > >> Stata or R including variable labels and value labels, i.e. > >> essentially the underlying logic of the questionnaire. Before, we were > >> using pencil paper questionnaires and the free (but not open source) > >> CSPro as data entry tool. It was possible to directlyexportto Stata, > >> which was extremely handy. Any change in the questionnaire was > >> immediately and correctly reflected in the Stataexport. As any > >> digital questionnaire includes the information of (a) the question, > >> (b) variable name, (c) answer categories in words and (d) associated > >> values in numbers. As far as I understood, (a)-(c) would be lost in > >> ODK .csvexportand only (d) is kept. Is that correct? IMHO, this > >> would mean much (error-prone) duplication of work. > > >> How difficult is it to include (a)-(c) and to develop anexportto > >> statistical programmes? Has already anyone started to write anexport > >> filter to one of these statistical programmes? Thank you very much! > > >> Cheers and Happy New Year, > >> Gerry > > >> -- > >> Post: opendatakit@googlegroups.com > >> Unsubscribe: opendatakit+unsubscribe@googlegroups.com > >> Options:http://groups.google.com/group/opendatakit?hl=en > > > -- > > Post: opendatakit@googlegroups.com > > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > > Options:http://groups.google.com/group/opendatakit?hl=en

Hi guys,

sorry for my late answer. I know that you can export your data to
statistical software (I am using Stata, have used SAS before) using
csv files.

What I meant with labels is what Yaw wrote: For example, the variable
label for the question whether the respondent is male or female could
be "Sex of respondent" and the value labels would be "female", "male",
while the values might be q001 and 1 and 2.

In order to prevent errors and extra work, especially with larger
questionnaires, it would be nice to have them in your statistical
software from the start. It's not an absolute necessity more for
convenience reasons. In any case one works on the values, but not
having to look up any external databases or coding sheets or having to
add labels in the statistical software (which I would do, in case the
data comes without labels) is imo a nice advantage.

Hope I could clear up a little bit, what I was looking for.

Regards

Olivier

··· On Dec 10, 9:30 am, Peter Kaiga wrote: > Hi Oliver, > > Its true what YAW have told you, the data can be exported as CSV which is > compatible with some statistical software, currently am i'm a statistician > and uses R for my analysis. R is able to get data from Google Docs.This is > how it's done, download R using this linkhttp://cran.r-project.org/mirrors.html. The software is open source and > quite strong. > > then install R studio which has a better user interface. press on import > data set, select from web URL, write the appspot url and the data will be > Viewed on the R script. Then change the name from app id to the name of the > data sheet. select Yes for heading, select comma for separator, decimal put > period, quote write None. ten press import. then from there you are able to > analyse the data. > > Let me know if you manage this, also let me know the Statistical software > you use i try, you have indicated (SAS, Stata,.......) > > Regards, > Peter Kaiga > > Cell: +254 723295150. > > > > > > > > On Fri, Dec 9, 2011 at 8:10 AM, Yaw Anokwa wrote: > > oliver, > > > can you give me an example of what you call a "label"? i'm assuming in > > a select prompt, it's what is shown to the user, but not the > > underlying value that is saved in the form? if that is the case, why > > can't you use the value? > > > yaw > > > On Wed, Dec 7, 2011 at 07:52, Olivier Kalmus wrote: > > > Thanks for the replies so far. I do have that CAPI report from the > > > World Bank. > > > In the report they state that labels cannot be transferred to a > > > statistical software from ODK, whereas in the appendix it says > > > difficult but possible. > > > While this is not a major problem, having the possibility of exporting > > > the labels would be great to have. > > > > On Dec 7, 3:02 pm, Yaw Anokwa wrote: > > >> also, if you are considering using odk for capi, the worldbank did an > > >> evaluation of a bunch of mobile data collection tools (including odk) > > >> for capi. might be good to take a look through it to understand the > > >> tradeoffs... > > > >>http://web.worldbank.org/WBSITE/EXTERNAL/EXTDEC/EXTRESEARCH/EXTLSMS/E. > > .. > > > >> On Wed, Dec 7, 2011 at 08:58, Gaetano Borriello wrote: > > >> > From ODK Aggregate you can export all your data in .csv files which > > >> > can be imported into most statistical packages. > > > >> > On Wed, Dec 7, 2011 at 3:28 AM, Olivier Kalmus < o.kal...@googlemail.com> wrote: > > >> >> Hi guys, > > > >> >> our research team is planning a panel household survey (1500 > > >> >> households), for which we want to use CAPI software on mobile > > devices. > > >> >> ODK looks very interesting. One question I couldn't find an answer to > > >> >> is, is it possible to export the data to a statistical analysis > > >> >> software (Stata, SAS...) with variable and value labels? > > > >> >> Thanks for any help > > > >> >> Olivier > > > >> >> -- > > >> >> Post: opendatakit@googlegroups.com > > >> >> Unsubscribe: opendatakit+unsubscribe@googlegroups.com > > >> >> Options:http://groups.google.com/group/opendatakit?hl=en > > > >> > -- > > >> > Post: opendatakit@googlegroups.com > > >> > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > > >> > Options:http://groups.google.com/group/opendatakit?hl=en > > > > -- > > > Post: opendatakit@googlegroups.com > > > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > > > Options:http://groups.google.com/group/opendatakit?hl=en > > > -- > > Post: opendatakit@googlegroups.com > > Unsubscribe: opendatakit+unsubscribe@googlegroups.com > > Options:http://groups.google.com/group/opendatakit?hl=en