overview for nsa

6

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models (decodingtrust.github.io)

submitted 1 year ago by nsa@kbin.social to c/machinelearning@kbin.social

0 comments fedilink

DecodingTrust is the Adversarial GLUE Benchmark. DecodingTrust aims at providing a thorough assessment of trustworthiness in GPT models.

This research endeavor is designed to help researchers and practitioners better understand the capabilities, limitations, and potential risks involved in deploying these state-of-the-art Large Language Models (LLMs).
This project is organized around the following eight primary perspectives of trustworthiness, including:

Toxicity
Stereotype and bias
Adversarial robustness
Out-of-Distribution Robustness
Privacy
Robustness to Adversarial Demonstrations
Machine Ethics
Fairness

Paper: https://arxiv.org/abs/2306.11698
Repo: https://github.com/AI-secure/DecodingTrust

3

Craft an Iron Sword: Dynamically Generating Interactive Game Characters by Prompting Large Language Models Tuned on Code (www.microsoft.com)

submitted 1 year ago* (last edited 1 year ago) by nsa@kbin.social to c/machinelearning@kbin.social

0 comments fedilink

Here's some preliminary work from Microsoft from 2022 that incorporates OpenAI's Codex model to make NPCs that can interact with the player using natural language instructions. It works by defining an API of functions the bot can use, then having Codex generate function calls in response to the player's instructions.

Paper: https://aclanthology.org/2022.wordplay-1.3/
Repo: https://github.com/microsoft/interactive-minecraft-npcs
Videos: Introductory Demo, Escape Room Demo

14

r/MachineLearning finally received a warning from u/ModCodeOfConduct (media.kbin.social)

submitted 1 year ago* (last edited 1 year ago) by nsa@kbin.social to c/machinelearning@kbin.social

1 comments fedilink