ZHAI Dong-sheng, YANG Yang. USPTO Patent Information Extraction System Based on XML[J]. Journal of Beijing University of Technology, 2011, 37(4): 628-633.
    Citation: ZHAI Dong-sheng, YANG Yang. USPTO Patent Information Extraction System Based on XML[J]. Journal of Beijing University of Technology, 2011, 37(4): 628-633.

    USPTO Patent Information Extraction System Based on XML

    • In order to provide basic data for improving the intellectual property early warming capacity and the competitiveness of high-tech industries of Beijing,by searching the database of the United States Patent and Trademark Office,patent information in the form of dynamic pages can be gotten.Based on XML related technology,a method to extract and store patent information in local relational database is put forward in this paper.The web pages are filtered by regular expression matching,and then the document object models of the pages are cleaned.Finally the patent information is extracted by XSLT matching and stored to relation database by object mapping.The prototype of the patent extraction system is designed and implemented,which has a high recall rate and precision rate.
    • loading

    Catalog

      Turn off MathJax
      Article Contents

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return