Association

Association

What is Association?

Association is a relationship between two random variables which makes them statistically dependent. It refers to rather a general relationship without specifics of the relationship being mentioned, and it is not necessary to be a causal relationship.


Measure of association is called correlation. When someone says that causation was found, this means that the researchers found that changes in one variable they measured directly caused changes in the other.



Above Figure illustrate the idea behind the Correlation and Causation.
Let consider the following example.
It shows that a Vocabulary test score out of 10 and the participator’s Age, Nationality (American or Not), Gender, Years of and Education. So that need to find the relationship among the quantitative (Vocab-Score, Age and Years of Education). 





 .corr() method in pandas package helps to measure the correlation between two variables. 




According to the above Figure Vocab score and Years of education has around 48% of relationship. 




More over that how to find relationship for a two text....?





Idea of distance between two words is based on the likeness of their meaning or semantic content as opposed to similarity which can be estimated regarding their syntactical representation (e.g. their string format). These are mathematical tools used to estimate the strength of the semantic relationship between units of language, concepts or instances, through a numerical description obtained according to the comparison of information supporting their meaning or describing their nature.
In python there is method call .similarity () in SpaCy package is used to identify the similarity among two texts.


Let’s look at the following example.

What is the relationship between following texts?
Dog, Cat, Rabbit, Oliver, Take Care and Environment friendly

1)    Importing packages.

2)    Texts and Similarity matrix

3)    Visualization of Similarity using Heat map. 


Comments