Lately in my research, I have been focusing on correlations between behavioral data from a large dataset. As is known, correlation is an expression of how well the linear relationship between two sets of data, that is to say, how they are related under the simple linear model. To give an example, for researchers, number of papers and their salary are well correlated. Thus, you can use a researcher’ number of papers to predict his/her salary. It is important to note that ‘correlation does NOT imply causation‘.