Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM)

Anjanava Biswas; Wrick Talukdar

doi:10.55662/JST.2023.4605

Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM)

Authors

Anjanava Biswas AWS AI&ML, IEEE CIS, California, USA Author https://orcid.org/0009-0008-3225-3316
Wrick Talukdar AWS AI & ML, IEEE CIS, California, USA Author https://orcid.org/0009-0008-2028-4342

DOI:

https://doi.org/10.55662/JST.2023.4605

Keywords:

trust and safety, large language models, language model safety, guardrails, ethical AI

Abstract

The AI era has ushered in Large Language Models (LLM) to the technological forefront, which has been much of the talk in 2023, and is likely to remain as such for many years to come. LLMs are the AI models that are the power house behind generative AI applications such as ChatGPT. These AI models, fueled by vast amounts of data and computational prowess, have unlocked remarkable capabilities, from human-like text generation to assisting with natural language understanding (NLU) tasks. They have quickly become the foundation upon which countless applications and software services are being built, or at least being augmented with. However, as with any groundbreaking innovations, the rise of LLMs brings forth critical safety, privacy, and ethical concerns. These models are found to have a propensity to leak private information, produce false information, and can be coerced into generating content that can be used for nefarious purposes by bad actors, or even by regular users unknowingly. Implementing safeguards and guardrailing techniques is imperative for applications to ensure that the content generated by LLMs are safe, secure, and ethical. Thus, frameworks to deploy mechanisms that prevent misuse of these models via application implementations is imperative. In this study, we propose a Flexible Adaptive Sequencing mechanism with trust and safety modules, that can be used to implement safety guardrails for the development and deployment of LLMs.

Downloads

Download data is not yet available.

Downloads

Published

01-11-2023

Issue

Vol. 4 No. 6 (2023): Journal of Science & Technology

Section

Research Articles

License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

License Terms

Ownership and Licensing:

Authors of this research paper submitted to the journal owned and operated by The Science Brigade Group retain the copyright of their work while granting the journal certain rights. Authors maintain ownership of the copyright and have granted the journal a right of first publication. Simultaneously, authors agreed to license their research papers under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License.

License Permissions:

Under the CC BY-NC-SA 4.0 License, others are permitted to share and adapt the work, as long as proper attribution is given to the authors and acknowledgement is made of the initial publication in the Journal. This license allows for the broad dissemination and utilization of research papers.

Additional Distribution Arrangements:

Authors are free to enter into separate contractual arrangements for the non-exclusive distribution of the journal's published version of the work. This may include posting the work to institutional repositories, publishing it in journals or books, or other forms of dissemination. In such cases, authors are requested to acknowledge the initial publication of the work in this Journal.

Online Posting:

Authors are encouraged to share their work online, including in institutional repositories, disciplinary repositories, or on their personal websites. This permission applies both prior to and during the submission process to the Journal. Online sharing enhances the visibility and accessibility of the research papers.

Responsibility and Liability:

Authors are responsible for ensuring that their research papers do not infringe upon the copyright, privacy, or other rights of any third party. The Science Brigade Publishers disclaim any liability or responsibility for any copyright infringement or violation of third-party rights in the research papers.

How to Cite

[1]

A. Biswas and W. Talukdar, “Guardrails for trust, safety, and ethical development and deployment of Large Language Models (LLM)”, J. Sci. Tech., vol. 4, no. 6, pp. 55–82, Nov. 2023, doi: 10.55662/JST.2023.4605.

Download Citation