This post is devoted with the quick (and free) solution of a problem that maybe some statisticians face in their jobs. Maps are becoming a valuable tool of information and you will find that R could be a comprehensive solution for this type of task.You will need to distinguish two types of data: the cartography … Sigue leyendo Making maps in R with ggplot
Etiqueta: R
Writing Books with R and knitr
I am writing this post on behalf people who, like me, do not find any valuable stuff in the web, when trying to compile big documents in R with the knitr library. knitr is a basic tool for the statistician. It combines LaTeX and R in an single environment. It is useful to create elegant reports, … Sigue leyendo Writing Books with R and knitr
Removing data from Workspace in RStudio (for MAC users)
In order to remove the data saved (maybe by mistake) in your workspace from RStudio, you gotta find the route in the first message that you can see when opening RStudio. For example, in my case I can read the following sentence when opening the software: Workspace loaded from ~/.RData.Then, open the terminal and write the … Sigue leyendo Removing data from Workspace in RStudio (for MAC users)
Selecting a Real Sample for Electoral Studies
What is the sample size required to achieve a particular margin of error in electoral studies? Professor Leonardo Bautista found that at least 15.000 people should be interviewed, distributed in 6.200 blocks, 80 municipalities and 4 strata. That is a lot of people! — In average, 188 persons per municipality. — Now, do the math. If a single … Sigue leyendo Selecting a Real Sample for Electoral Studies
Regression discontinuity plots with R (using ggplot2)
In public policy evaluation sometimes we use a regression discontinuity analysis in order to estimate the impact of an intervention.Long story short, we suppose that there is a threshold (real value of a design variable) for which we can divide the population of interest into two groups. So, for those individuals that have values higher … Sigue leyendo Regression discontinuity plots with R (using ggplot2)
LaTeX for Statisticians (Our course in USTA)
What do successful statisticians do in real life? I think that success relies in effective communication. All of the time, we are in broadcast mode: our ideas, results and analysis yield to decision making and it is up to us to find compelling ways to seed our beliefs in those statistical analyses into the mind … Sigue leyendo LaTeX for Statisticians (Our course in USTA)
Common support graphics in propensity score matching (using ggplot2)
Common support is a must check when doing impact evaluation of public policies (or - more widely - when verifying the causation of some factors over a population). The seminal paper of Rosenbaum and Rubin (1983) defines the propensity score as a measure of balance. Moreover, when doing matching, we must assure that propensity scores … Sigue leyendo Common support graphics in propensity score matching (using ggplot2)
Proper Inference in Public Policy Evaluation
I know, I know... Did you miss me alot? I did miss you.This time I am going to write about a the role of the statistician in Public Policy evaluation. This have captured my attention during the last four years. This is what I do in my consulting at DNP. It is not easy but it … Sigue leyendo Proper Inference in Public Policy Evaluation
Propensity Score Matching in R (My first post in blogger)
My first post in blogger is about Propensity Score Matching (PSM) in R. In this post I will introduce the R packages MatchIt and Zelig. The first one is devoted to perform different PSM algorithms and the other one is used in order to estimate the average treatment effect (ATE) and the effect of the … Sigue leyendo Propensity Score Matching in R (My first post in blogger)
Our Talk in Lima
In this talk, we want to introduce the study of Small Area Estimation (SAE) by means of demographic and some simple statistical methodologies. The example of interest is the estimation of the size of Chocó (one of the poorest departments in Colombia). Enjoy it!
My talk in Cartagena
In these slides you will find my talk about design-unbiased estimation of gross flows that take into account the complex sample along with a two-stage Markov Chain model to describe the nonresponse process.
Our poster in CLAPEM
In this poster we introduce the bayesian modeling of the Gini coefficient per locality in Bogotá by using a Beta regressión. R-codes are available after requesting to the first author. Enjoy it!