<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "https://jats.nlm.nih.gov/publishing/1.3/JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xml:lang="en">
  <front xmlns:xlink="http://www.w3.org/1999/xlink">
    <journal-meta>
      <journal-id journal-id-type="elibrary">9004</journal-id>
      <journal-title-group>
        <journal-title>Problems of information security. Computer systems</journal-title>
        <trans-title-group xml:lang="ru">
          <trans-title>Проблемы информационной безопасности. Компьютерные системы</trans-title>
        </trans-title-group>
      </journal-title-group>
      <issn pub-type="epub">2071-8217</issn>
    </journal-meta>
    <article-meta xmlns:xlink="http://www.w3.org/1999/xlink">
      <article-id pub-id-type="publisher-id">4</article-id>
      <article-id pub-id-type="doi">10.48612/jisp/mbvv-n1u7-z7be</article-id>
      <title-group>
        <article-title>From exploitation to protection: a deep dive into adversarial attacks on LLMS</article-title>
        <trans-title-group xml:lang="ru">
          <trans-title>От эксплуатации к защите: анализ атак на большие языковые модели</trans-title>
        </trans-title-group>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0009-0005-6662-5606</contrib-id>
          <name>
            <surname>Velichko</surname>
            <given-names>Ivan</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>wwr0ngn4m3@gmail.com</email>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0002-0924-6221</contrib-id>
          <name>
            <surname>Bezzateev</surname>
            <given-names>Sergey</given-names>
          </name>
          <xref ref-type="aff" rid="aff1"/>
          <email>sergey.bezzateev@gmail.com</email>
        </contrib>
      </contrib-group>
      <aff id="aff1">Saint Petersburg State University of Aerospace Instrumentation</aff>
      <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2025-03-25">
        <day>25</day>
        <month>03</month>
        <year>2025</year>
      </pub-date>
      <issue>1</issue>
      <fpage>43</fpage>
      <lpage>58</lpage>
      <self-uri xmlns:xlink="http://www.w3.org/1999/xlink" content-type="pdf" xlink:href="https://jisp.spbstu.ru/userfiles/files/2025_1-7-8.pdf"/>
      <abstract xml:lang="en">
        <p>Modern large language models possess impressive capabilities but remain vulnerable to various attacks that can manipulate their responses, lead to leakage of confidential data, or bypass restrictions. This paper focuses on the analysis of prompt injection attacks, which allow bypassing model constraints, extracting hidden data, or forcing the model to follow malicious instructions.</p>
      </abstract>
      <kwd-group xml:lang="en">
        <kwd>Large language models</kwd>
        <kwd>artificial intelligence</kwd>
        <kwd>adversarial attacks</kwd>
        <kwd>defense methods</kwd>
        <kwd>model output manipulation</kwd>
      </kwd-group>
    </article-meta>
  </front>
</article>
