<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="ru">
  <front xmlns:xlink="http://www.w3.org/1999/xlink">
    <journal-meta>
      <journal-id journal-id-type="elibrary">9004</journal-id>
      <journal-title-group>
        <journal-title>Problems of information security. Computer systems</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Проблемы информационной безопасности. Компьютерные системы</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2071-8217</issn>
    </journal-meta>
    <article-meta xmlns:xlink="http://www.w3.org/1999/xlink">
      <article-id pub-id-type="publisher-id">13</article-id>
      <article-id pub-id-type="doi">10.48612/jisp/693e-m24n-96zh</article-id>
      <title-group>
        <article-title>Optimization of data obfuscation in big data processing and storage systems</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Оптимизация обфускации данных в системах обработки и хранения больших данных</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0001-9659-1244</contrib-id>
          <name>
            <surname>Poltavtseva</surname>
            <given-names>Maria</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>potavtseva@ibks.spbstu.ru</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Peter the Great St. Petersburg Polytechnic University</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2025-09-30">
        <day>30</day>
        <month>09</month>
        <year>2025</year>
      </pub-date>
      <issue>3</issue>
      <fpage>165</fpage>
      <lpage>179</lpage>
      <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="pdf" xlink:href="https://jisp.spbstu.ru/userfiles/files/soderzhaniya/pib_3_5-6.pdf"/>
      <abstract xml:lang="en">
        <p>The paper is devoted to the task of reducing the attack surface from an internal attacker in heterogeneous big data processing and storage systems by choosing the optimal method of data obfuscation based on anonymization (depersonalization) technologies. The paper analyzes terminology and systematizes data hiding methods to reduce the attack surface in big data processing and storage systems. A formal formulation of the problem of finding the optimal method of data obfuscation and an algorithm for solving it over various types of datasets are proposed, taking into account evaluation criteria specific to each class of methods. The implementation of a software prototype to support decision-making and the choice of the optimal method for solving practical problems is described, experimental approbation and analysis of its results are carried out.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>Big data security</kwd>
        <kwd>big data management systems</kwd>
        <kwd>data privacy</kwd>
        <kwd>data obfuscation</kwd>
        <kwd>data anonymization</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
