<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="en">
  <front xmlns:xlink="http://www.w3.org/1999/xlink">
    <journal-meta>
      <journal-id journal-id-type="elibrary">9004</journal-id>
      <journal-title-group>
        <journal-title>Problems of information security. Computer systems</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Проблемы информационной безопасности. Компьютерные системы</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2071-8217</issn>
    </journal-meta>
    <article-meta xmlns:xlink="http://www.w3.org/1999/xlink">
      <article-id pub-id-type="publisher-id">5</article-id>
      <article-id pub-id-type="doi">10.48612/jisp/ruar-u6he-kmd4</article-id>
      <title-group>
        <article-title>Analysis of decompiled program code using abstract syntax trees</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Анализ восстановленного программного кода с использованием абстрактных синтаксических деревьев</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Gribkov</surname>
            <given-names>Nikita</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>gribkov.na@edu.spbstu.ru</email>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0002-2009-5460</contrib-id>
          <name>
            <surname>Ovasapyan</surname>
            <given-names>Tigran</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>otd@ibks.spbstu.ru</email>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Moskvin</surname>
            <given-names>Dmitry</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>moskvin_da@spbstu.ru</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Peter the Great St. Petersburg Polytechnic University</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2023-06-08">
        <day>08</day>
        <month>06</month>
        <year>2023</year>
      </pub-date>
      <issue>2</issue>
      <fpage>47</fpage>
      <lpage>60</lpage>
      <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="pdf" xlink:href="https://jisp.spbstu.ru/userfiles/files/2023_2.pdf"/>
      <abstract xml:lang="en">
        <p>The paper proposes a method of preprocessing fragments of binary code for the task of detection their similarity using machine learning algorithms. The method is based on analysis of pseudocode, which is retrieved from decompilation process. The pseudocode is preprocessed with usage of attributed abstract syntax trees. Evaluation of the method indicates its efficiency in binary code similarity detection task due to semantic vectors used for abstract syntax tree modification.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>code clones</kwd>
        <kwd>syntactic similarity</kwd>
        <kwd>semantic similarity</kwd>
        <kwd>binary code similarity</kwd>
        <kwd>abstract syntax tree</kwd>
        <kwd>pseudocode</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
