<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="ru">
  <front xmlns:xlink="http://www.w3.org/1999/xlink">
    <journal-meta>
      <journal-id journal-id-type="elibrary">9004</journal-id>
      <journal-title-group>
        <journal-title>Problems of information security. Computer systems</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Проблемы информационной безопасности. Компьютерные системы</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2071-8217</issn>
    </journal-meta>
    <article-meta xmlns:xlink="http://www.w3.org/1999/xlink">
      <article-id pub-id-type="publisher-id">10</article-id>
      <article-id pub-id-type="doi">10.48612/jisp/2nkk-unee-uuzv</article-id>
      <title-group>
        <article-title>From exploitation to protection: analysis of methods for defending against attacks on LLMS</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>От эксплуатации к защите: анализ методов защиты от атак на языковые модели</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0009-0005-6662-5606</contrib-id>
          <name>
            <surname>Velichko</surname>
            <given-names>Ivan</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>wwr0ngn4m3@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0002-0924-6221</contrib-id>
          <name>
            <surname>Bezzateev</surname>
            <given-names>Sergey</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>sergey.bezzateev@gmail.com</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Saint Petersburg State University of Aerospace Instrumentation</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2025-09-30">
        <day>30</day>
        <month>09</month>
        <year>2025</year>
      </pub-date>
      <issue>3</issue>
      <fpage>110</fpage>
      <lpage>120</lpage>
      <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="pdf" xlink:href="https://jisp.spbstu.ru/userfiles/files/soderzhaniya/pib_3_5-6.pdf"/>
      <abstract xml:lang="en">
        <p>Modern large language models demonstrate impressive capabilities but remain vulnerable to attacks that can manipulate their behavior, extract confidential data, or bypass built-in restrictions. This paper focuses on methods for protecting language models from prompt injection attacks, which allow adversaries to exploit the system for malicious purposes. Various defense strategies are examined and analyzed, including query filtering, context isolation, training on perturbed data, and other approaches. A comparative analysis of the effectiveness of defense mechanisms is conducted, highlighting their limitations and identifying future directions for enhancing the security of language models.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>Large language models</kwd>
        <kwd>artificial intelligence</kwd>
        <kwd>adversarial attacks</kwd>
        <kwd>defense methods</kwd>
        <kwd>model output manipulation</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
