Object Representation for Lattice-Based Machine Learning

This is the fourth article in a series (links to the first , second and thirdarticles), devoted to the machine learning system based on the theory of lattices, called "VKF-system". The program uses algorithms based on Markov chains to generate the causes of the target property by calculating a random subset of similarities between some groups of learning objects. This article describes the representation of objects through bit strings in order to compute the similarities by bitwise multiplication of the corresponding representations. Objects with discrete features require some technique from Formal Concept Analysis. The case of objects with continuous features uses logistic regression, dividing the area of ​​change into subintervals using information theory and a representation corresponding to the convex hull of the compared intervals.



got idea!



1 Discrete signs



, , - . , ""/"". 'null' ( '_' ), () .



. . , .



( , ), () .





⟨L,∧,∨⟩ G () ∧- M () ∨- . gIm⇔gβ‰₯m (G,M,I) L(G,M,I), ⟨L,∧,∨⟩.



x∈L ⟨L,∧,∨⟩ ∨-, xβ‰ βˆ… y,z∈L y<x z<x y∨z<x.

x∈L ⟨L,∧,∨⟩ ∧-, xβ‰ T y,z∈L x<y x<z x<y∧z.



∧- , , ∨- , .



irreducible elements



( . (L,L,β‰₯))



G\M h i j k
a 1 1 1 0
b 0 1 1 1
c 1 1 0 0
d 1 0 1 0
f 0 1 0 1
g 0 0 1 1


, .



, 121 , 24 !



, :



  1. .
  2. β‰₯ , ( ∨- ).
  3. (∨-) .
  4. .


CPython-: 'vkfencoder' vkfencoder.XMLImport 'vkf' vkf.FCA. β€” : vkf.FCA MariaDB, vkfencoder.XMLImport XML .



2



. C4.5 .

, .

, , , . .



2.1



, . .



E=GβˆͺO G - O. [a,b)βŠ†R V:Gβ†’R G[a,b)={g∈G:a≀V(g)<b}, O[a,b)={g∈O:a≀V(g)<b}

E[a,b)={g∈E:a≀V(g)<b}.



[a,b)βŠ†R V:Gβ†’R



ent[a,b)=βˆ’|G[a,b)||E[a,b)|β‹…log2(|G[a,b)||E[a,b)|)βˆ’|O[a,b)||E[a,b)|β‹…log2(|O[a,b)||E[a,b)|)



a<r<b [a,b)βŠ†R V:Gβ†’R



inf[a,r,b)=|E[a,r)||E[a,b)|β‹…ent[a,r)+|E[r,b)||E[a,b)|β‹…ent[r,b).



β€” V=r .



V:Gβ†’R a=min{V} v0, vl+1 , b=max{V}. {v1<…<vl} .



2.2



2l, l β€” . ()



Ξ΄Vi(g)=1⇔V(g)β‰₯viΟƒVi(g)=1⇔V(g)<vi,



1≀i≀l.



Ξ΄V1(g)…δVl(g)ΟƒV1(g)…σVl(g) V g∈E.



, β€” .



Ξ΄(1)1…δ(1)lΟƒ(1)1…σ(1)l vi≀V(A1)<vj Ξ΄(2)1…δ(2)lΟƒ(2)1…σ(2)l vn≀V(A2)<vm.



(Ξ΄(1)1β‹…Ξ΄(2)1)…(Ξ΄(1)lβ‹…Ξ΄(2)l)(Οƒ(1)1β‹…Οƒ(2)1)…(Οƒ(1)lβ‹…Οƒ(2)l)



min{vi,vn}≀V((A1βˆͺA2)β€³)<max{vj,vm}.



, 0...00...0 min{V}≀V((A1βˆͺA2)β€³)≀max{V}.



2.3



. ( 1). . , .





pi1∨...∨pik pi1+...+pik>Οƒ 0<Οƒ<1.



,



— c:Rd→{0,1}, Rd — ( d ) {0,1} — .



, βŸ¨β†’X,K⟩∈RdΓ—{0,1},



pβ†’X,K(β†’x,k)=pβ†’X(β†’x)β‹…pKβˆ£β†’X(kβˆ£β†’x),



pβ†’X(β†’x) β€” () , a pKβˆ£β†’X(kβˆ£β†’x) β€” , .. β†’x∈Rd



pKβˆ£β†’X(kβˆ£β†’x)=P{K=kβˆ£β†’X=β†’x}...



c:Rd→{0,1}



R(c)=P{c(β†’X)β‰ K}...



b:Rdβ†’{0,1} pKβˆ£β†’X(kβˆ£β†’x)



b(β†’x)=1⇔pKβˆ£β†’X(1βˆ£β†’x)>12>pKβˆ£β†’X(0βˆ£β†’x)





b :



βˆ€c:Rdβ†’{0,1}[R(b)=P{b(β†’X)β‰ K}≀R(c)]





pKβˆ£β†’X(1βˆ£β†’x)=pβ†’X∣K(β†’x∣1)β‹…P{K=1}pβ†’X∣K(β†’x∣1)β‹…P{K=1}+pβ†’X∣K(β†’x∣0)β‹…P{K=0}==11+pβ†’X∣K(β†’x∣0)β‹…P{K=0}pβ†’X∣K(β†’x∣1)β‹…P{K=1}=11+exp{-a(β†’x)}=Οƒ(a(β†’x)),



a(β†’x)=logpβ†’X∣K(β†’x∣1)β‹…P{K=1}pβ†’X∣K(β†’x∣0)β‹…P{K=0} Οƒ(y)=11+exp{-y} β€” .



2.4



a(β†’x)=logpβ†’X∣K(β†’x∣1)β‹…P{K=1}pβ†’X∣K(β†’x∣0)β‹…P{K=0} β†’wTβ‹…Ο†(β†’x) Ο†i:Rdβ†’R (i=1,...,m) β†’w∈Rm.



βŸ¨β†’x1,k1⟩,...,βŸ¨β†’xn,kn⟩ tj=2kj-1.



log{p(t1,...,tnβˆ£β†’x1,...,β†’xn,β†’w)}=-nβˆ‘j=1log[1+exp{-tjmβˆ‘i=1wiΟ†i(β†’xj)}]...



,



L(w1,...,wm)=-nβˆ‘j=1log[1+exp{-tjmβˆ‘i=1wiΟ†i(β†’xj)}]β†’max



.



-



β†’wt+1=β†’wt-(βˆ‡β†’wTβˆ‡β†’wL(β†’wt))-1β‹…βˆ‡β†’wL(β†’wt)...



sj=11+exp{tjβ‹…(wTβ‹…Ξ¦(xj))}



βˆ‡L(β†’w)=-Ξ¦Tdiag(t1,...,tn)β†’s,βˆ‡βˆ‡L(β†’w)=Ξ¦TRΞ¦,



R=diag(s1(1-s1),s2(1-s2),...,sn(1-sn)) β€”

s1(1-s1),s2(1-s2),...,sn(1-sn) diag(t1,...,tn)β†’s β€” t1s1,t2s2,...,tnsn.



→wt+1=→wt+(ΦTRΦ)-1ΦTdiag(t)→s=(ΦTRΦ)-1ΦTR→z,



β†’z=Ξ¦β†’wt+R-1diag(t1,...,tn)β†’s β€” .



, - -



→wt+1=(ΦTRΦ+λ⋅I)-1⋅(ΦTR→z)...



"-" : 1 .



, . :



- Vk ,



R2=1-exp{2(L(w0,...,wk-1)-L(w0,...,wk-1,wk))/n}β‰₯Οƒ



Vk ,



1-L(w0,...,wk-1,wk)L(w0,...,wk-1)β‰₯Οƒ





"-" Wine Quality ( . ). . ( >7), .



( 2.3) "" "". ( ) , 0 1. " " "" .



But the situation with the pair ("pH", "alcohol") was radically different. The "alcohol" weight was positive while the "pH" weight was negative. But with the help of an obvious logical transformation, we got the implication ("pH"β‡’ "alcohol").



The author would like to thank his colleagues and students for their support and incentives.




All Articles