
Using the Malawi household data, this exploratory analysis found a positive correlation between wealth and education(b6). The education level was recoded assuming that “Std” were primary level of education. It was found that diversity of diet is positively correlated with education level. However, there was an unexpected negative correlation of number of deficient micronutrients with education level. This relationship was more profound when households that were not deficient in calories and proteins were removed.

# Data loading code removed (user-specific file path)
t.m.nutr <- read.csv(fp.mwi.nutr)

# variables indicating deficiency in nutrients, in the required level
req.names <- names(t.m.nutr)[grep("req_hh",names(t.m.nutr))]
# variables indicating deficiency in micro-nutrients(exclude calories and proteins), in the required level
req.names.micro <- req.names[3:14]
##  [1] "def_iron_req_hh"     "def_zinc_req_hh"     "def_vita_A_req_hh"  
##  [4] "def_vita_E_req_hh"   "def_vita_C_req_hh"   "def_ribof_req_hh"   
##  [7] "def_thia_req_hh"     "def_nia_req_hh"      "def_vita_B6_req_hh" 
## [10] "def_fol_req_hh"      "def_calcium_req_hh"  "def_vita_B12_req_hh"
for (i in 1:nrow(t.m.nutr)){
  t.m.nutr[i,"defCountReq"] <- sum(t.m.nutr[i,req.names] == "Yes" )
for (i in 1:nrow(t.m.nutr)){
  t.m.nutr[i,"defCountReqMicro"] <- sum(t.m.nutr[i,req.names.micro] == "Yes")

A positive relationship of household wealth and education level (b6)

#Education -> wealth
options("scipen"=100, "digits"=4) #Help print numeric text into numbers
plot <- ggplot() + theme(panel.background = element_rect(fill=0,colour="black"), axis.ticks=element_blank(), panel.grid = element_blank()) 
plot <- plot + theme(axis.text.x = element_text(angle=45, hjust =1))
plot <- plot + geom_boxplot(data=t.m.nutr,aes(t.m.nutr$b6,t.m.nutr$wealth))
plot <- plot + labs(title=paste("Wealth and Education Level",sep="")) #+ theme(plot.title = element_text(size = rel(2), colour = "blue"))
plot <- plot + xlab("Education Level (b6)") # x-axis title
plot <- plot + ylab("Wealth") # y-axis title

Recoding education level

The education level was recoded to show a clearer trend:
1=primary (No school and all “Std” levels)
2=secondary (“Form” levels)

#Recode education level
rowi.2 <- grep("Form",t.m.nutr$b6)
rowi.1 <- grep("Std|No",t.m.nutr$b6)
rowi.3 <- grep("College|Diploma",t.m.nutr$b6)
t.m.nutr$edLvl[rowi.1] <- 1
t.m.nutr$edLvl[rowi.2] <- 2
t.m.nutr$edLvl[rowi.3] <- 3

#Using the new variable against wealth
plot <- ggplot() + theme(panel.background = element_rect(fill=0,colour="black"), axis.ticks=element_blank(), panel.grid = element_blank()) 
plot <- plot + theme(axis.text.x = element_text(angle=45, hjust =1))
plot <- plot + geom_boxplot(data=t.m.nutr,aes(factor(t.m.nutr$edLvl),t.m.nutr$wealth))
plot <- plot + xlab("Education Level (recoded)") # x-axis title
plot <- plot + ylab("Wealth") # y-axis title

##      No Yes
##   1 771 204
##   2 105  54
##   3   7   1
##      No Yes
##   1 613 362
##   2  73  86
##   3   4   4

A positive relationship between education level and diversity of diet (Shannon index and Simpson index)

# Diversity of food
plot <- ggplot() + theme(panel.background = element_rect(fill=0,colour="black"), axis.ticks=element_blank(), panel.grid = element_blank()) 
plot <- plot + theme(axis.text.x = element_text(angle=45, hjust =1))
plot <- plot + xlab("Education Level (recoded)") # x-axis title
plot <- plot + ylab("Diversity of diet (Shannon index)") # y-axis title
plot <- plot + geom_boxplot(data=t.m.nutr,aes(factor(t.m.nutr$edLvl),t.m.nutr$shannon_exp))

plot <- ggplot() + theme(panel.background = element_rect(fill=0,colour="black"), axis.ticks=element_blank(), panel.grid = element_blank()) 
plot <- plot + theme(axis.text.x = element_text(angle=45, hjust =1))
plot <- plot + geom_boxplot(data=t.m.nutr,aes(factor(t.m.nutr$edLvl),t.m.nutr$simpson_exp))
plot <- plot + xlab("Education Level (recoded)") # x-axis title
plot <- plot + ylab("Diversity of diet (Simpson index)") # y-axis title

A negative relationship was noticed between education level and the number of deficient micro-nutrients

The higher the education level of the households, the more the number of micronutrients they are missing.

t.m.nutr.lite <- t.m.nutr[which(t.m.nutr$def_kcal_rec_hh == "No" & t.m.nutr$def_prot_rec_hh == "No"),]

# Deficiency
plot <- ggplot() + theme(panel.background = element_rect(fill=0,colour="black"), axis.ticks=element_blank(), panel.grid = element_blank()) 
plot <- plot + theme(axis.text.x = element_text(angle=45, hjust =1))
plot <- plot + geom_boxplot(data=t.m.nutr,aes(factor(t.m.nutr$edLvl),t.m.nutr$defCountReqMicro))
plot <- plot + xlab("Education Level") # x-axis title
plot <- plot + ylab("Number of deficient micronutrients") # y-axis title

The negative relationship becomes more prominent in the tertiary education level after removing households that were not deficient in calories and proteins

# Only on people who are not deficient in calory and protein
plot <- ggplot() + theme(panel.background = element_rect(fill=0,colour="black"), axis.ticks=element_blank(), panel.grid = element_blank()) 
plot <- plot + theme(axis.text.x = element_text(angle=45, hjust =1))
plot <- plot + geom_boxplot(data=t.m.nutr.lite,aes(factor(t.m.nutr.lite$edLvl),t.m.nutr.lite$defCountReqMicro))
plot <- plot + labs(title=paste("N=723/1124 (Household not deficient in kcal and protein)",sep="")) #+ theme(plot.title = element_text(size = rel(2), colour = "blue"))
plot <- plot + xlab("Education Level") # x-axis title
plot <- plot + ylab("Number of deficient micronutrients") # y-axis title

Total number of household:

## [1] 1142

Number of household after removing those that were not deficient in carlories and proteins

## [1] 723

Note that the number of households is small for those with Tertiary level of education

##   1   2   3 
## 638  81   4