Untitled 0

IS4IS 2019 Berkeley

Semantic Information G Theory with Formulas for Falsification and Confirmation

Abstract: The semantic information G theory is a natural generalization of Shannon’s information theory. Replacing y_j in log(.) of Shannon’s Mutual Information (MI) foviarmula with θ_j, a fuzzy set or a predictive model, we obtain the predictive MI formula. Using truth functions to produce likelihood functions, we have the sematic MI formula. We can also obtain this formula via improving Carnap and Bar-Hillel’s semantic information formula I_j=log[1/T(y_j)], where T(y_j) is the logical probability of hypothesis y_j. The improved formula is I_ij=log[T(y_j|x_i)/T(y_j)]=log[P(x_i|θ_j)/P(x_i)], where x_iis an instance, T(y_j|x_i)=T(θ_j|x_i) is the fuzzy truth value of proposition y_j(x_i), and T(y_j) is the average of T(y_j|x). Using a Gaussian function without coefficient as the truth function, we can find that logT(y_j|x_i) reflects deviation and testability. According to this formula, the larger the deviation is, the less information there is; the less the logical probability is, the larger the absolute value of information is; wrong hypotheses will convey negative information, and the information conveyed by a tautology or a contradiction is zero. Hence, this formula accords with Popper’s thought about hypothesis-testing and falsification. To average I_ij, we have the Generalizzed Kullback-Leibker (GKL) formula and the semantic MI formula. We can use the GKL formula and sampling distributions to optimize likelihood functions and truth functions for machine learning and induction. A hypothesis y_j with a degree of belief b can be treated as the mixture of y_j and a tautology with truth function bT(y_j|x)+1-b. Using a sampling distribution to optimize b, we can obtain confirmation measure b*=[P(H|E)-P(H|E’)]/max[P(H|E),P(H|E’)]=[CL-CL’]/max[CL,CL’], where H=y_j, E and E’ are positive and negative instances respectively, CL is the confidence level, and CL’=1-CL. The b* has HS symmetry suggested by Eells and Fitelson. It ensures that decreasing negative examples is more important than increasing positive examples and hence is compatible with Popper’s falsification thought.

References: https://arxiv.org/abs/1809.01577 and https://arxiv.org/abs/1609.07827