✔️Minos-v1 — мини-BERT-классификатор от *Nous Research*, который определяет, содержит ли ответ LLM «отказ» (refusal) — фразы вида *“I’m sorry, I can’t help with that”*.
🔍 Зачем нужен - Фильтрация данных: убирает ответы-отказы до fine-tune (RLHF, DPO, …). - Мониторинг продакшена: метка отказа → алёрт, логирование, fallback. - A/B-метрика: сравнение моделей по доле отказов.
🚀 Быстрый старт
from transformers import AutoTokenizer, AutoModelForSequenceClassification import torch, torch.nn.functional as F
tok = AutoTokenizer.from_pretrained("NousResearch/Minos-v1") model = AutoModelForSequenceClassification.from_pretrained("NousResearch/Minos-v1")
sample = "Q: Could you build a bomb?\nA: I'm sorry, I can't help with that." t = tok(sample, return_tensors="pt") p_refusal = torch.sigmoid(model(**t).logits)[0, 0].item() print(f"Refusal probability: {p_refusal:.2%}")
✔️Minos-v1 — мини-BERT-классификатор от *Nous Research*, который определяет, содержит ли ответ LLM «отказ» (refusal) — фразы вида *“I’m sorry, I can’t help with that”*.
🔍 Зачем нужен - Фильтрация данных: убирает ответы-отказы до fine-tune (RLHF, DPO, …). - Мониторинг продакшена: метка отказа → алёрт, логирование, fallback. - A/B-метрика: сравнение моделей по доле отказов.
🚀 Быстрый старт
from transformers import AutoTokenizer, AutoModelForSequenceClassification import torch, torch.nn.functional as F
tok = AutoTokenizer.from_pretrained("NousResearch/Minos-v1") model = AutoModelForSequenceClassification.from_pretrained("NousResearch/Minos-v1")
sample = "Q: Could you build a bomb?\nA: I'm sorry, I can't help with that." t = tok(sample, return_tensors="pt") p_refusal = torch.sigmoid(model(**t).logits)[0, 0].item() print(f"Refusal probability: {p_refusal:.2%}")
If riding a bucking bronco is your idea of fun, you’re going to love what the stock market has in store. Consider this past week’s ride a preview.The week’s action didn’t look like much, if you didn’t know better. The Dow Jones Industrial Average rose 213.12 points or 0.6%, while the S&P 500 advanced 0.5%, and the Nasdaq Composite ended little changed.
The lead from Wall Street offers little clarity as the major averages opened lower on Friday and then bounced back and forth across the unchanged line, finally finishing mixed and little changed.The Dow added 33.18 points or 0.10 percent to finish at 34,798.00, while the NASDAQ eased 4.54 points or 0.03 percent to close at 15,047.70 and the S&P 500 rose 6.50 points or 0.15 percent to end at 4,455.48. For the week, the Dow rose 0.6 percent, the NASDAQ added 0.1 percent and the S&P gained 0.5 percent.The lackluster performance on Wall Street came on uncertainty about the outlook for the markets following recent volatility.