{"id":35590,"date":"2025-03-05T11:46:10","date_gmt":"2025-03-05T10:46:10","guid":{"rendered":"https:\/\/risc.web-email.at\/fachbeitraege\/my-data-my-language-model\/"},"modified":"2026-03-10T14:24:35","modified_gmt":"2026-03-10T13:24:35","slug":"my-data-my-language-model","status":"publish","type":"publication","link":"https:\/\/risc.web-email.at\/en\/technicalarticles\/my-data-my-language-model\/","title":{"rendered":"My data My language model"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">How secure chatting with sensitive company data becomes possible<\/h2>\n\n<p class=\"has-medium-font-size\">by Lukas Fallmann and Sandra Wartner, MSc<\/p>\n\n<p><em>Artificial intelligence (AI) is not only revolutionizing everyday life, but is also unfolding its enormous potential in the corporate environment. Workflows can be improved through the (partial) automation of tasks such as the creation of documents. Customer service can also be optimized through AI-supported personalization. The provision of data insights by AI systems also helps to optimize processes. Large language models (LLMs) such as ChatGPT offer a wide range of promising applications for this. At the same time, however, concerns about the handling of sensitive information (e.g. company secrets or personal data) often remain an obstacle to the widespread acceptance of these technologies. The issue of data security is usually opaque or non-existent with large providers.      <\/em><\/p>\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n<div class=\"wp-block-media-text has-media-on-the-right is-stacked-on-mobile is-image-fill-element\"><div class=\"wp-block-media-text__content\">\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<h3 class=\"wp-block-heading\">Table of contents<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>My data, my rules: Data protection and transparency in the use of LLMs<\/li>\n\n\n\n<li>Specific information for specific questions &#8211; Retrieval Augmentation Generation (RAG)\n<ul class=\"wp-block-list\">\n<li>1) Fine tuning<\/li>\n\n\n\n<li>2) Retrieval Augmented Generation<\/li>\n<\/ul>\n<\/li>\n\n\n\n<li>Evaluation of RAG systems<\/li>\n\n\n\n<li>Outlook &amp; conclusion<\/li>\n\n\n\n<li>References<\/li>\n\n\n\n<li>Read more<\/li>\n\n\n\n<li>Authors<\/li>\n<\/ul>\n<\/div><figure class=\"wp-block-media-text__media\"><img decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/cstaub_Visualization_of_a_language_model_security_internal_co_d438bb2f-f2de-40a1-8cff-031f52e1b5a2_2-1-1.png\" alt=\"\" class=\"wp-image-31881 size-full\" style=\"object-position:50% 50%\" srcset=\"https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/cstaub_Visualization_of_a_language_model_security_internal_co_d438bb2f-f2de-40a1-8cff-031f52e1b5a2_2-1-1.png 1024w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/cstaub_Visualization_of_a_language_model_security_internal_co_d438bb2f-f2de-40a1-8cff-031f52e1b5a2_2-1-1-300x300.png 300w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/cstaub_Visualization_of_a_language_model_security_internal_co_d438bb2f-f2de-40a1-8cff-031f52e1b5a2_2-1-1-150x150.png 150w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/cstaub_Visualization_of_a_language_model_security_internal_co_d438bb2f-f2de-40a1-8cff-031f52e1b5a2_2-1-1-768x768.png 768w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure><\/div>\n\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n<p>The rapid advances in AI have now also significantly improved the quality of publicly available model alternatives. Companies can now also use powerful models securely and specifically with company-owned data in a specially selected infrastructure. These new possibilities also ensure transparent and trustworthy use that prioritizes the protection of sensitive information. In this article, you can find out how this is possible at a technical level and what is important.   <\/p>\n<div class=\"wp-block-group-container alignfull \">\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<h3 class=\"wp-block-heading\">My data, my rules: Data protection and transparency in the use of LLMs<\/h3>\n\n\n\n<p>In general, a distinction can be made between open-source and closed-source models. As the name suggests, closed-source LLMs are models in which the implementation details are not publicly accessible. This has the direct consequence that it is not clearly traceable whether and how the queries and model responses entered are further used by LLM providers. However, the advantage of closed-source applications is that business processes can be improved quickly with little to no development effort.   <\/p>\n\n\n\n<p>In contrast, open source models such as those from <a href=\"https:\/\/mistral.ai\/\" target=\"_blank\" rel=\"noreferrer noopener\">Mistral AI<\/a> offer far more transparency. Essential model information can be viewed and is therefore easier to understand. With information on the architecture and model weights (in which the learned knowledge is encoded), the model can also be further developed (trained) with company-specific data. This allows its performance to be improved for a specific use case, language or domain. The use of open source models offers many advantages in this respect, but this goes hand in hand with increased effort and the necessary technical expertise. Such models often have to be hosted in a special infrastructure with sufficient computing resources and the support effort is significantly higher.     <\/p>\n\n\n\n<p>The following table shows a comparison of selected functions:<\/p>\n\n\n\n<figure class=\"wp-block-table is-style-stripes\"><table class=\"has-fixed-layout\"><thead><tr><th>Function<\/th><th>Closed Source<\/th><th>Open Source<\/th><\/tr><\/thead><tbody><tr><td>Transparency of model information (e.g. architecture, model weights, training data)<\/td><td>Not available<\/td><td>Publicly accessible<\/td><\/tr><tr><td>Data protection<\/td><td>Depending on the security precautions and logging mechanisms of the providers<\/td><td>Full transparency and control over data security thanks to specially implemented security measures<\/td><\/tr><tr><td>Personalization<\/td><td>Limited customization options (often restricted by prompt engineering)<\/td><td>Can be adapted as required (e.g. through further model training)<\/td><\/tr><tr><td>Development effort<\/td><td>Usually low setup and configuration effort<\/td><td>Higher initial outlay, may have to be hosted yourself<\/td><\/tr><tr><td>Costs<\/td><td>Ongoing license costs (mostly pay-per-use)<\/td><td>No ongoing license costs, but (acquisition) costs in the form of infrastructure or computing resources (on premise, cloud GPUs,&#8230;)<\/td><\/tr><tr><td>Examples<\/td><td>ChatGPT (OpenAI), Gemini (Google)<\/td><td>Mistral or Mixtral (Mistral-AI), Phi3 (Microsoft), Llama3 (Meta-AI)<\/td><\/tr><\/tbody><\/table><\/figure>\n<\/div>\n<\/div><div class=\"wp-block-group-container alignfull \">\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<h3 class=\"wp-block-heading\">Specific information for specific questions &#8211; Retrieval Augmentation Generation (RAG)<\/h3>\n\n\n\n<p>LLMs are excellent at processing questions and tasks. However, native models such as <a href=\"https:\/\/chat.openai.com\/\" target=\"_blank\" rel=\"noreferrer noopener\">ChatGPT<\/a> or <a href=\"https:\/\/gemini.google.com\/?hl=de\" target=\"_blank\" rel=\"noreferrer noopener\">Gemini<\/a> usually reach their limits when it comes to answering company-specific questions. LLMs are limited to the information with which they have been trained. There are two different approaches to enable the retrieval of company-specific information.   <\/p>\n\n\n\n<h3 class=\"wp-block-heading\">1) Fine tuning<\/h3>\n\n\n\n<p>The computationally intensive option is to further train an existing open source model with internal company information (finetuning). Finetuning allows the model to incorporate domain-specific knowledge, such as medical terms, into its vocabulary. However, this is costly and inflexible, as new information can only be added by retraining and is particularly problematic if information changes frequently.  <\/p>\n\n\n\n<h3 class=\"wp-block-heading\">2) Retrieval Augmented Generation<\/h3>\n\n\n\n<p>More flexibility can be achieved using &#8220;Retrieval Augmented Generation&#8221; (RAG for short) [1], whereby this approach stores the knowledge to be integrated in an external data store that can be adapted at any time. Based on a query to the LLM, the relevant information is extracted from the knowledge base in a preliminary step (&#8220;retrieval&#8221;) and made available to the LLM as context (&#8220;augmented&#8221;) to answer the question (&#8220;generation&#8221;). For this process, it is not necessary for the LLM to have been trained with the data in advance.  <\/p>\n\n\n\n<p>The knowledge base is built up via the <em>indexing <\/em>process (see also Fig. 1). During indexing, each individual document (e.g. text, HTML, PDF files) is loaded (&#8220;load&#8221;) and prepared in such a way that smaller, semantically coherent sub-documents are created (&#8220;split&#8221;). Next, a machine-readable numerical representation (a vector or so-called &#8220;embedding&#8221;) of the partial documents is created (&#8220;embed&#8221;) using a separate machine learning model. This &#8220;embedding&#8221; contains relevant, machine-readable information and is stored in a special vector database (&#8220;store&#8221;).   <\/p>\n\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69de5e433fbf2&quot;}\" data-wp-interactive=\"core\/image\" class=\"wp-block-image aligncenter size-large is-resized wp-lightbox-container\"><img decoding=\"async\" width=\"1024\" height=\"724\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on-async--click=\"actions.showLightbox\" data-wp-on-async--load=\"callbacks.setButtonStyles\" data-wp-on-async-window--resize=\"callbacks.setButtonStyles\" sizes=\"(max-width: 1024px) 100vw, 1024px\" src=\"https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Indexing-1024x724.jpg\" alt=\"\" class=\"wp-image-31883\" style=\"width:704px;height:auto\" srcset=\"https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Indexing-1024x724.jpg 1024w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Indexing-300x212.jpg 300w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Indexing-768x543.jpg 768w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Indexing-1536x1086.jpg 1536w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Indexing.jpg 1920w\" \/><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on-async--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<p><strong>Figure 1:<\/strong> When a question is submitted to the RAG system, it must also be converted into a machine-readable embedding. With the help of algorithms for calculating similarities between the question embedding and the document embeddings, the most similar documents can be quickly searched for in the knowledge database with little computational effort, the corresponding original texts can be automatically inserted into the prompt and the LLM can be called for context-based answer generation (see also Figure 2). <\/p>\n\n\n\n<figure data-wp-context=\"{&quot;imageId&quot;:&quot;69de5e4340333&quot;}\" data-wp-interactive=\"core\/image\" class=\"wp-block-image aligncenter size-large is-resized wp-lightbox-container\"><img decoding=\"async\" width=\"1024\" height=\"724\" data-wp-class--hide=\"state.isContentHidden\" data-wp-class--show=\"state.isContentVisible\" data-wp-init=\"callbacks.setButtonStyles\" data-wp-on-async--click=\"actions.showLightbox\" data-wp-on-async--load=\"callbacks.setButtonStyles\" data-wp-on-async-window--resize=\"callbacks.setButtonStyles\" sizes=\"(max-width: 1024px) 100vw, 1024px\" src=\"https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Retrieval-and-generation-1024x724.jpg\" alt=\"\" class=\"wp-image-31885\" style=\"width:588px;height:auto\" srcset=\"https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Retrieval-and-generation-1024x724.jpg 1024w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Retrieval-and-generation-300x212.jpg 300w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Retrieval-and-generation-768x543.jpg 768w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Retrieval-and-generation-1536x1086.jpg 1536w, https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Retrieval-and-generation.jpg 1920w\" \/><button\n\t\t\tclass=\"lightbox-trigger\"\n\t\t\ttype=\"button\"\n\t\t\taria-haspopup=\"dialog\"\n\t\t\taria-label=\"Enlarge\"\n\t\t\tdata-wp-init=\"callbacks.initTriggerButton\"\n\t\t\tdata-wp-on-async--click=\"actions.showLightbox\"\n\t\t\tdata-wp-style--right=\"state.imageButtonRight\"\n\t\t\tdata-wp-style--top=\"state.imageButtonTop\"\n\t\t>\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"12\" height=\"12\" fill=\"none\" viewBox=\"0 0 12 12\">\n\t\t\t\t<path fill=\"#fff\" d=\"M2 0a2 2 0 0 0-2 2v2h1.5V2a.5.5 0 0 1 .5-.5h2V0H2Zm2 10.5H2a.5.5 0 0 1-.5-.5V8H0v2a2 2 0 0 0 2 2h2v-1.5ZM8 12v-1.5h2a.5.5 0 0 0 .5-.5V8H12v2a2 2 0 0 1-2 2H8Zm2-12a2 2 0 0 1 2 2v2h-1.5V2a.5.5 0 0 0-.5-.5H8V0h2Z\" \/>\n\t\t\t<\/svg>\n\t\t<\/button><\/figure>\n\n\n\n<p><strong>Figure 2: <\/strong>The implementation of such a system requires a great deal of initial effort, as there are numerous adjustments to be made in order to find the optimal functionality, such as the model itself, different data sources, languages or the integration of tables or images. However, the effort often pays off very quickly and offers optimal starting conditions for further applications, such as a company-internal Q&amp;A bot. The knowledge base is formed by company-relevant documents, which can be kept synchronized with the vector database so that the Q&amp;A bot is always up to date with the latest knowledge.    <\/p>\n<\/div>\n<\/div><div class=\"wp-block-group-container alignfull \">\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<h3 class=\"wp-block-heading\">Evaluation of RAG systems<\/h3>\n\n\n\n<p>It is in the nature of language that different formulations can have the same core statement, which also makes the automatic comparison of texts very difficult. Another challenge lies in being able to quantitatively compare individual RAG configurations with each other. Also, since the answers generated by the LLM are rarely clearly wrong or correct, a standardized evaluation method with clearly defined metrics is required.  <\/p>\n\n\n\n<figure class=\"wp-block-table is-style-stripes\"><table class=\"has-fixed-layout\"><thead><tr><th><strong>Question<\/strong><\/th><th><strong>LLM answer<\/strong><\/th><th><strong>Actual answer<\/strong><\/th><th class=\"has-text-align-center\" data-align=\"center\"><strong>Answer Correctness<\/strong><\/th><\/tr><\/thead><tbody><tr><td>Who is the CEO of RISC Software GmbH?<\/td><td>Wolfgang Freiseisen is CEO of RISC Software GmbH.<\/td><td>Wolfgang Freiseisen is CEO of RISC Software GmbH.<\/td><td class=\"has-text-align-center\" data-align=\"center\">1.0<\/td><\/tr><tr><td>Who is the CEO of RISC Software GmbH?<\/td><td>Max Mustermann is CEO of RISC Software.<\/td><td>Wolfgang Freiseisen is CEO of RISC Software GmbH.<\/td><td class=\"has-text-align-center\" data-align=\"center\">0.16<\/td><\/tr><tr><td>Who is the CEO of RISC Software GmbH?<\/td><td>Wolfgang Freiseisen is Managing Director of RISC Software GmbH.<\/td><td>Wolfgang Freiseisen is CEO of RISC Software GmbH.<\/td><td class=\"has-text-align-center\" data-align=\"center\">0.99<\/td><\/tr><\/tbody><\/table><\/figure>\n<\/div>\n<\/div><div class=\"wp-block-group-container alignfull \">\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<h3 class=\"wp-block-heading\">Outlook &amp; conclusion<\/h3>\n\n\n\n<p>RAG can be used to implement a number of company-specific use cases. As with all AI-based systems, the quality of the answers generated in RAG systems is heavily dependent on the configuration and the quality of the data provided. Clean data quality management pays off, as incorrect information or contradictions in the knowledge database often not only lead to poorer quality model responses, but also often creep unnoticed into other processes (see also [3] and [4]).  <\/p>\n\n\n\n<p>Knowledge can also be extended to other sources of information with the help of so-called &#8220;agents&#8221;. Agents are advanced AI systems that have been developed for the creation of complex texts that require sequential reasoning. Agents can intelligently forward specific tasks to other expert systems and interfaces that are most suitable for a specific query or task. For example, the query &#8220;Search the web for RISC software&#8221; could be forwarded to a web search component or a query to solve a complex mathematical problem could be forwarded directly to a specially trained model. Agents can also be used in combination with evaluation tools such as Ragas to regenerate low-quality answers. Such tools will make it even easier to optimize business processes in the future.  <\/p>\n<\/div>\n<\/div><div class=\"wp-block-group-container alignfull \">\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<h3 class=\"wp-block-heading\">References<\/h3>\n\n\n\n<p>[1] <a href=\"https:\/\/arxiv.org\/abs\/2312.10997\" target=\"_blank\" rel=\"noopener\">https:\/\/arxiv.org\/abs\/2312.10997<\/a> Gao, Yunfan, et al. &#8220;Retrieval-augmented generation for large language models: A survey.&#8221; arXiv preprint arXiv:2312.10997 (2023).<\/p>\n\n\n\n<p>[2] <a href=\"https:\/\/arxiv.org\/abs\/2309.15217\" target=\"_blank\" rel=\"noopener\">https:\/\/arxiv.org\/abs\/2309.15217<\/a> Es, Shahul, et al. &#8220;Ragas: Automated evaluation of retrieval augmented generation.&#8221; arXiv preprint arXiv:2309.15217 (2023).<\/p>\n\n\n\n<p>[3] <a href=\"https:\/\/risc.web-email.at\/en\/technicalarticles\/technical-article-data-quality\/\">https:\/\/risc.web-email.at\/fachbeitraege\/fachbeitrag-datenqualitaet\/<\/a><\/p>\n\n\n\n<p>[4] <a href=\"https:\/\/risc.web-email.at\/en\/technicalarticles\/technical-article-data-quality-in-practice\/\">https:\/\/risc.web-email.at\/fachbeitraege\/fachbeitrag-datenqualitaet-in-der-praxis\/<\/a><\/p>\n<\/div>\n<\/div>\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-28f84493 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<h2 class=\"wp-block-heading\">Ansprechperson<\/h2>\n\n\n\n<div class=\"wp-block-contact-form-7-contact-form-selector\">\n<div class=\"wpcf7 no-js\" id=\"wpcf7-f663-o1\" lang=\"en-US\" dir=\"ltr\" data-wpcf7-id=\"663\">\n<div class=\"screen-reader-response\"><p role=\"status\" aria-live=\"polite\" aria-atomic=\"true\"><\/p> <ul><\/ul><\/div>\n<form action=\"\/en\/wp-json\/wp\/v2\/publication\/35590#wpcf7-f663-o1\" method=\"post\" class=\"wpcf7-form init\" aria-label=\"Contact form\" novalidate=\"novalidate\" data-status=\"init\">\n<fieldset class=\"hidden-fields-container\"><input type=\"hidden\" name=\"_wpcf7\" value=\"663\" \/><input type=\"hidden\" name=\"_wpcf7_version\" value=\"6.1.5\" \/><input type=\"hidden\" name=\"_wpcf7_locale\" value=\"en_US\" \/><input type=\"hidden\" name=\"_wpcf7_unit_tag\" value=\"wpcf7-f663-o1\" \/><input type=\"hidden\" name=\"_wpcf7_container_post\" value=\"0\" \/><input type=\"hidden\" name=\"_wpcf7_posted_data_hash\" value=\"\" \/>\n<\/fieldset>\n<div class=\"form-row\">\n\t<div class=\"form-input\">\n\t\t<p><label class=\"sr-only\" for=\"your-name\">Your name <\/label><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"your-name\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" id=\"your-name\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Name\" value=\"\" type=\"text\" name=\"your-name\" \/><\/span>\n\t\t<\/p>\n\t<\/div>\n\t<div class=\"form-input\">\n\t\t<p><label class=\"sr-only\" for=\"your-email\">Your email<\/label><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"your-email\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-email wpcf7-validates-as-required wpcf7-text wpcf7-validates-as-email\" id=\"your-email\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"E-Mail\" value=\"\" type=\"email\" name=\"your-email\" \/><\/span>\n\t\t<\/p>\n\t<\/div>\n<\/div>\n<div class=\"form-row\">\n\t<div class=\"form-input\">\n\t\t<p><label class=\"sr-only\" for=\"your-company\">Company <\/label><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"your-company\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text\" id=\"your-company\" aria-invalid=\"false\" placeholder=\"Unternehmen\" value=\"\" type=\"text\" name=\"your-company\" \/><\/span>\n\t\t<\/p>\n\t<\/div>\n\t<div class=\"form-input\">\n\t\t<p><label class=\"sr-only\" for=\"your-position\">Position<\/label><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"your-position\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text\" aria-invalid=\"false\" placeholder=\"Position\" value=\"\" type=\"text\" name=\"your-position\" \/><\/span>\n\t\t<\/p>\n\t<\/div>\n<\/div>\n<div class=\"form-row\">\n\t<div class=\"form-input\">\n\t\t<p><label class=\"sr-only\" for=\"your-subject\"> Subject <\/label><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"your-subject\"><input size=\"40\" maxlength=\"400\" class=\"wpcf7-form-control wpcf7-text wpcf7-validates-as-required\" id=\"your-subject\" aria-required=\"true\" aria-invalid=\"false\" placeholder=\"Thema\" value=\"\" type=\"text\" name=\"your-subject\" \/><\/span>\n\t\t<\/p>\n\t<\/div>\n<\/div>\n<p><span id=\"wpcf7-69de5e4348a5d-wrapper\" class=\"wpcf7-form-control-wrap phone-95-wrap\" style=\"display:none !important; visibility:hidden !important;\"><label for=\"wpcf7-69de5e4348a5d-field\" class=\"hp-message\">Please leave this field empty.<\/label><input id=\"wpcf7-69de5e4348a5d-field\"  class=\"wpcf7-form-control wpcf7-text\" type=\"text\" name=\"phone-95\" value=\"\" size=\"40\" tabindex=\"-1\" autocomplete=\"new-password\" \/><\/span><br \/>\n<label class=\"sr-only\" for=\"your-message\"> Your message (optional)<\/label><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"your-message\"><textarea cols=\"40\" rows=\"10\" maxlength=\"2000\" class=\"wpcf7-form-control wpcf7-textarea\" id=\"your-message\" aria-invalid=\"false\" placeholder=\"Ihre Nachricht an uns\" name=\"your-message\"><\/textarea><\/span><br \/>\n<span class=\"wpcf7-form-control-wrap\" data-name=\"hcap-cf7\">\t\t<input\n\t\t\t\ttype=\"hidden\"\n\t\t\t\tclass=\"hcaptcha-widget-id\"\n\t\t\t\tname=\"hcaptcha-widget-id\"\n\t\t\t\tvalue=\"eyJzb3VyY2UiOlsiY29udGFjdC1mb3JtLTdcL3dwLWNvbnRhY3QtZm9ybS03LnBocCJdLCJmb3JtX2lkIjo4fQ==-87b48f281889db60dae4720b9dc28d55\">\n\t\t\t\t<span id=\"hcap_cf7-69de5e4348f579.53120690\" class=\"wpcf7-form-control h-captcha \"\n\t\t\tdata-sitekey=\"3a6a81c1-2b2e-4b2a-b1eb-d9446bc09afb\"\n\t\t\tdata-theme=\"light\"\n\t\t\tdata-size=\"normal\"\n\t\t\tdata-auto=\"false\"\n\t\t\tdata-ajax=\"false\"\n\t\t\tdata-force=\"false\">\n\t\t<\/span>\n\t\t<input type=\"hidden\" id=\"_wpnonce\" name=\"_wpnonce\" value=\"c96e028190\" \/><input type=\"hidden\" name=\"_wp_http_referer\" value=\"\/en\/wp-json\/wp\/v2\/publication\/35590\" \/><\/span><input class=\"wpcf7-form-control wpcf7-submit has-spinner btn\" type=\"submit\" value=\"Senden\" \/>\n<\/p><div class=\"wpcf7-response-output\" aria-hidden=\"true\"><\/div>\n<\/form>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<h2 class=\"wp-block-heading\">Authors<\/h2>\n\n\n<div class=\"contact-person\">\n      <picture>\n      \n      \n      \n      \n      <img decoding=\"async\" data-aos=\"fade-zoom-in\"\n           data-aos-offset=\"0\" class=\"w-full\" width=\"212\" height=\"293\"\n           src=\"https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/Portraitfoto_lfallman_freigestellt.jpg\"\n           alt=\"\">\n    <\/picture>\n    \n\n<h5 class=\"wp-block-heading\">Lukas Fallmann<\/h5>\n\n\n\n<p>Data Scientist<\/p>\n\n  <\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<div style=\"height:100px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n<div class=\"contact-person\">\n      <picture>\n      \n      \n      \n      \n      <img decoding=\"async\" data-aos=\"fade-zoom-in\"\n           data-aos-offset=\"0\" class=\"w-full\" width=\"212\" height=\"293\"\n           src=\"https:\/\/risc.web-email.at\/app\/uploads\/2023\/06\/swartner1-removebg-preview.png\"\n           alt=\"\">\n    <\/picture>\n    \n\n<h5 class=\"wp-block-heading\">Sandra Wartner, MSc<\/h5>\n\n\n\n<p>Data Scientist<\/p>\n\n  <\/div>\n<\/div>\n<\/div>\n<div class=\"wp-block-group-container alignfull \">\n<div class=\"wp-block-group alignwide is-layout-constrained wp-block-group-is-layout-constrained\">\n<h2 class=\"wp-block-heading\">Read more<\/h2>\n\n\n<div class=\"posts-slider-block\" data-aos=\"fade-up\" data-aos-offset=\"0\" data-aos-anchor-placement=\"top-bottom\">\n        <section class=\"splide posts-slider\" aria-label=\"Gallery Slides\">\n            <div class=\"splide__arrows\">\n                <button class=\"splide__arrow splide__arrow--prev\">\n                    <span class=\"sr-only\">Previous<\/span>\n                    <img decoding=\"async\" loading=\"lazy\" width=\"25\" height=\"21\" src=\"https:\/\/risc.web-email.at\/app\/themes\/risc-theme\/public\/images\/icon-arrow.35d2ec.svg\"\n                         alt=\"Previous\">\n                <\/button>\n                <button class=\"splide__arrow splide__arrow--next\">\n                    <span class=\"sr-only\">Next<\/span>\n                    <img decoding=\"async\" loading=\"lazy\" width=\"25\" height=\"21\" src=\"https:\/\/risc.web-email.at\/app\/themes\/risc-theme\/public\/images\/icon-arrow.35d2ec.svg\"\n                         alt=\"Next\">\n                <\/button>\n            <\/div>\n            <div class=\"inner\">\n                <div class=\"splide__track\">\n                    <div class=\"splide__list\">\n\n                                                    <a href=\"https:\/\/risc.web-email.at\/en\/technicalarticles\/technical-article-natural-language-processing-1\/\" class=\"splide__slide blog-post-teaser mb-1 lg:mb-3\">\n                                <div class=\"blog-image\">\n                                                                                                                                <picture>\n                                                                                        <img decoding=\"async\" src=\"https:\/\/risc.web-email.at\/app\/uploads\/2023\/06\/shutterstock_1348453082-360x214.jpg\"\n                                                 alt=\"\u201cOK Google: What is Natural Language Processing?\u201d\">\n                                        <\/picture>\n                                                                    <\/div>\n                                <div class=\"blog-content px-2 py-3 xl:px-4 xl:py-5\">\n                                    <h3>\u201cOK Google: What is Natural Language Processing?\u201d<\/h3>\n                                    <div class=\"blog-post-excerpt mt-2\">\n                                        Natural Language Processing makes it possible to read, decode and understand human language by machine. Speech assistants, spelling correctors, email spam filters &#8211; NLP as a technology is omnipresent and already hides behind many processes and software applications deeply embedded in our everyday lives.\n                                    <\/div>\n                                    <span class=\"inline-block mt-2 more\">mehr erfahren <span class=\"ml-1 icon-more\"><\/span><\/span>\n\n                                <\/div>\n                            <\/a>\n                                                    <a href=\"https:\/\/risc.web-email.at\/en\/technicalarticles\/technical-article-transformer-models-conquer-natural-language-processing\/\" class=\"splide__slide blog-post-teaser mb-1 lg:mb-3\">\n                                <div class=\"blog-image\">\n                                                                                                                                <picture>\n                                                                                        <img decoding=\"async\" src=\"https:\/\/risc.web-email.at\/app\/uploads\/2023\/06\/shutterstock_2066538857-1-360x214.jpg\"\n                                                 alt=\"Transformer models conquer Natural Language Processing\">\n                                        <\/picture>\n                                                                    <\/div>\n                                <div class=\"blog-content px-2 py-3 xl:px-4 xl:py-5\">\n                                    <h3>Transformer models conquer Natural Language Processing<\/h3>\n                                    <div class=\"blog-post-excerpt mt-2\">\n                                        Progress in the field of Natural Language Processing is rapid and seems unstoppable. Between terms like Transformer, Transfer-Learning, BERT and T5 it is not always easy to keep up with the times.\n                                    <\/div>\n                                    <span class=\"inline-block mt-2 more\">mehr erfahren <span class=\"ml-1 icon-more\"><\/span><\/span>\n\n                                <\/div>\n                            <\/a>\n                                                    <a href=\"https:\/\/risc.web-email.at\/en\/technicalarticles\/technical-article-explainable-artificial-intelligence\/\" class=\"splide__slide blog-post-teaser mb-1 lg:mb-3\">\n                                <div class=\"blog-image\">\n                                                                                                                                <picture>\n                                                                                        <img decoding=\"async\" src=\"https:\/\/risc.web-email.at\/app\/uploads\/2023\/07\/iStock-1227226348-1-360x214.jpg\"\n                                                 alt=\"Explainable Artificial Intelligence (XAI): How Machine Learning Predictions Become Interpretable\">\n                                        <\/picture>\n                                                                    <\/div>\n                                <div class=\"blog-content px-2 py-3 xl:px-4 xl:py-5\">\n                                    <h3>Explainable Artificial Intelligence (XAI): How Machine Learning Predictions Become Interpretable<\/h3>\n                                    <div class=\"blog-post-excerpt mt-2\">\n                                        Artificial intelligence is due to artificial neural networks that are modeled on the human brain. In certain situations, it is necessary to be able to explain the decision-making basis of the models from the field of deep learning for predictions.\n                                    <\/div>\n                                    <span class=\"inline-block mt-2 more\">mehr erfahren <span class=\"ml-1 icon-more\"><\/span><\/span>\n\n                                <\/div>\n                            <\/a>\n                                                    <a href=\"https:\/\/risc.web-email.at\/en\/technicalarticles\/technical-article-natural-language-processing-3\/\" class=\"splide__slide blog-post-teaser mb-1 lg:mb-3\">\n                                <div class=\"blog-image\">\n                                                                                                                                <picture>\n                                                                                        <img decoding=\"async\" src=\"https:\/\/risc.web-email.at\/app\/uploads\/2023\/06\/2022-11-30-NLU-360x214.jpg\"\n                                                 alt=\"With Natural Language Understanding (NLU) from text chaos to knowledge gain\">\n                                        <\/picture>\n                                                                    <\/div>\n                                <div class=\"blog-content px-2 py-3 xl:px-4 xl:py-5\">\n                                    <h3>With Natural Language Understanding (NLU) from text chaos to knowledge gain<\/h3>\n                                    <div class=\"blog-post-excerpt mt-2\">\n                                        Regardless of whether fault messages in manufacturing processes are to be analyzed, doctor&#039;s letters are to be filed in a structured manner, or products are to be suggested automatically, Natural Language Understanding (NLU) offers a broad spectrum of industry-specific and cross-industry applications for gaining knowledge from unstructured text data.\n                                    <\/div>\n                                    <span class=\"inline-block mt-2 more\">mehr erfahren <span class=\"ml-1 icon-more\"><\/span><\/span>\n\n                                <\/div>\n                            <\/a>\n                                                    <a href=\"https:\/\/risc.web-email.at\/en\/technicalarticles\/technical-article-revolution-of-language-models-chatgpt\/\" class=\"splide__slide blog-post-teaser mb-1 lg:mb-3\">\n                                <div class=\"blog-image\">\n                                                                                                                                <picture>\n                                                                                        <img decoding=\"async\" src=\"https:\/\/risc.web-email.at\/app\/uploads\/2023\/06\/AdobeStock_566449319-360x214.jpg\"\n                                                 alt=\"(R)Evolution of Language Models - ChatGPT\">\n                                        <\/picture>\n                                                                    <\/div>\n                                <div class=\"blog-content px-2 py-3 xl:px-4 xl:py-5\">\n                                    <h3>(R)Evolution of Language Models &#8211; ChatGPT<\/h3>\n                                    <div class=\"blog-post-excerpt mt-2\">\n                                        Chat with an Artificial Intelligence &#8211; what the revolutionary language model ChatGPT is all about and what opportunities and risks its use entails.\n                                    <\/div>\n                                    <span class=\"inline-block mt-2 more\">mehr erfahren <span class=\"ml-1 icon-more\"><\/span><\/span>\n\n                                <\/div>\n                            <\/a>\n                                            <\/div>\n                <\/div>\n            <\/div>\n        <\/section>\n    <\/div>\n<\/div>\n<\/div>","protected":false},"excerpt":{"rendered":"<p>Workflows can be improved through the (partial) automation of tasks such as the creation of documents. Customer service can also be optimized through AI-supported personalization. The provision of data insights by AI systems also helps to optimize processes. Large language models (LLMs) such as ChatGPT offer a wide range of promising applications for this.     <\/p>\n","protected":false},"featured_media":31882,"template":"","publication-category":[50,74],"class_list":["post-35590","publication","type-publication","status-publish","has-post-thumbnail","hentry","publication-category-data-science-and-a-i","publication-category-industry-4-0"],"acf":[],"portrait_thumb_url":"https:\/\/risc.web-email.at\/app\/uploads\/2024\/09\/cstaub_Visualization_of_a_language_model_security_internal_co_d438bb2f-f2de-40a1-8cff-031f52e1b5a2_2-1-1-360x214.png","_links":{"self":[{"href":"https:\/\/risc.web-email.at\/en\/wp-json\/wp\/v2\/publication\/35590","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/risc.web-email.at\/en\/wp-json\/wp\/v2\/publication"}],"about":[{"href":"https:\/\/risc.web-email.at\/en\/wp-json\/wp\/v2\/types\/publication"}],"version-history":[{"count":1,"href":"https:\/\/risc.web-email.at\/en\/wp-json\/wp\/v2\/publication\/35590\/revisions"}],"predecessor-version":[{"id":35592,"href":"https:\/\/risc.web-email.at\/en\/wp-json\/wp\/v2\/publication\/35590\/revisions\/35592"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/risc.web-email.at\/en\/wp-json\/wp\/v2\/media\/31882"}],"wp:attachment":[{"href":"https:\/\/risc.web-email.at\/en\/wp-json\/wp\/v2\/media?parent=35590"}],"wp:term":[{"taxonomy":"publication-category","embeddable":true,"href":"https:\/\/risc.web-email.at\/en\/wp-json\/wp\/v2\/publication-category?post=35590"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}