Update How_to_use_moderation.ipynb (#1097)

This commit is contained in:
Teo Musatoiu 2024-03-11 16:58:16 +00:00 committed by GitHub
parent 1dad5f913e
commit bed41103a2
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -201,7 +201,7 @@
"metadata": {},
"source": [
"#### Setting moderation thresholds\n",
"Our output moderation will assess the LLM's response and block anything scoring a 0.4 or higher in any category. Setting this threshold is a common area for optimization - we recommend building an evaluation set and grading the results using a confusion matrix to set the right tolerance for your moderation. The trade-off here is generally:\n",
"OpenAI has selected thresholds for moderation categories that balance precision and recall for our use cases, but your use case or tolerance for moderation may be different. Setting this threshold is a common area for optimization - we recommend building an evaluation set and grading the results using a confusion matrix to set the right tolerance for your moderation. The trade-off here is generally:\n",
"\n",
"- More false positives leads to a fractured user experience, where customers get annoyed and the assistant seems less helpful.\n",
"- More false negatives can cause lasting harm to your business, as people get the assistant to answer inappropriate questions, or provide inappropriate responses.\n",