StatcanR, an R package to get all data from the new Statistics Canada Web Data Service

An Introduction.

Romain Le Duc https://sites.socialdatasciencelab.org/team/cvRomainLeDuc.html (IBM (Montréal, CA))https://www.ibm.com/ca-fr , Thierry Warin https://www.nuance-r.com/principalInvestigator.html (SKEMA Business School (Raleigh, NC))https://www.skemagloballab.io
2019-09-08

Introduction

The incredible acceleration of connected people across the world as well as the low-price accessibility to an amazing calculation power are created an explosive growth of data available on the World Wide Web. From now on, the World Wide Web and web data services, such as API, become essential for technological firms and government statistic department. These conditions place the World Wide Web as the largest publicly accessible data-bank in the world. Therefore, it presents an unprecedented opportunity to get a very large amount of data, apply data mining and econometrics in order to accelerate knowledge discovery.

In May 2018, according to this incredible acceleration, Statistics Canada has launched its Web Data Service. This Web Data Service provides an access to data and metadata that Statistics Canada release every day from different sources. This is a good option for users who want to consume a discrete amount of data points updates to Statistics Canada data.

Regarding to the Statistics Canada Web Data Service launch in May 2018, an existing R package used by a large number of people has failed. Therefore, face to this problem, the Social Data Science Lab team created a replacement function of CANSIM2R package, named statcanR, in order to still provide a efficient way to get statistics data table with the help of advanced analytical skills and thus still provide support in terms of data collection allowing to produce social and economics analysis of Canada at 3 different geographical granularity level, such as country of Canada, Canadian provinces as well as Canadian metropolitan areas.

Citation

For attribution, please cite this work as

Duc & Warin (2019, Sept. 8). SKEMA Global Lab in AI: StatcanR, an R package to get all data from the new Statistics Canada Web Data Service. Retrieved from https://skemagloballab.io/research.html/posts/2019-08-21-statcanr-an-r-package/

BibTeX citation

@misc{duc2019statcanr,,
  author = {Duc, Romain Le and Warin, Thierry},
  title = {SKEMA Global Lab in AI: StatcanR, an R package to get all data from the new Statistics Canada Web Data Service},
  url = {https://skemagloballab.io/research.html/posts/2019-08-21-statcanr-an-r-package/},
  year = {2019}
}