%0 Journal Article %A TAN Peng-xu %A ZHANG Lai-shun %T Information extraction using tree automata inference technique %D 2010 %R 10.3778/j.issn.1002-8331.2010.16.045 %J Computer Engineering and Applications %P 153-156 %V 46 %N 16 %X This paper proposes an information extraction method based on an improved k-contextual tree automata inference algorithm.The key idea is to transform (semi-) structured documents into tree,creating unranked tree automata which can accept the tree and extract data according to the unranked tree automata state of acceptance and rejection,using an advanced k-contextual tree language,which is called KLH tree language.The method makes full use of the tree structure of the web document and combines the method based on web structure with grammar inference.Experimental results show that the approach with tree automata inference is favorable against some other approach in the learning time and extraction time. %U http://cea.ceaj.org/EN/10.3778/j.issn.1002-8331.2010.16.045