• About TC
  • Affiliate Disclaimer
  • Privacy Policy
  • TOS
  • Contact
Sunday, July 6, 2025
Techcratic
  • TC
  • AI
    Artificial Intelligence

    Transforming network operations with AI: How Swisscom built a network assistant using Amazon Bedrock

    Artificial Intelligence

    EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

    Artificial Intelligence

    Instruction-Following Pruning for Large Language Models

    Artificial Intelligence

    How to Combine Streamlit, Pandas, and Plotly for Interactive Data Apps

    Artificial Intelligence

    Tailor responsible AI with new safeguard tiers in Amazon Bedrock Guardrails

    Artificial Intelligence

    Automate Data Quality Reports with n8n: From CSV to Professional Analysis

    Artificial Intelligence

    NewDay builds A Generative AI based Customer service Agent Assist with over 90% accuracy

    Artificial Intelligence

    5 Things You Need to Know About Agentic AI

    Artificial Intelligence

    Normalizing Flows are Capable Generative Models

  • App Zone
    Top 3 Launcher Apps for Apple: Features, Pros, and Cons

    Top 3 Launcher Apps for Apple: Features, Pros, and Cons

    Top 3 Launcher Apps for Android: Features, Pros, and Cons

    Top 3 Launcher Apps for Android: Features, Pros, and Cons

    Top 3 Card Game Apps of 2025: Features, Pros, and Cons

    Top 3 Card Game Apps of 2025: Features, Pros, and Cons

    Top 3 Medical Apps of 2025: Features, Pros, and Cons

    Top 3 Medical Apps of 2025: Features, Pros, and Cons

    Top 3 Travel Apps of 2025: Features, Pros, and Cons

    Top 3 Travel Apps of 2025: Features, Pros, and Cons

    Top 3 Casual Game Apps for 2025: Features, Pros, and Cons

    Top 3 Casual Game Apps for 2025: Features, Pros, and Cons

    Top 3 Food Apps for 2025: Features, Pros, and Cons

    Top 3 Food Apps for 2025: Features, Pros, and Cons

    Top 3 Sport Apps for 2025: Features, Pros, and Cons

    Top 3 Sport Apps for 2025: Features, Pros, and Cons

    Top 3 Productivity Apps for 2025: Features, Pros, and Cons

    Top 3 Productivity Apps for 2025: Features, Pros, and Cons

  • Apple
    Yes, you can run Windows 11 on your Mac — and it’s only $15

    Run Windows apps on your Mac with Windows 11 Pro — now just $9.97

    How to stop LG & Samsung smart TV tracking, screen captures

    How to stop LG & Samsung smart TV tracking, screen captures

    Apple’s F1 expected to hit $300M at the box office this weekend

    Apple’s F1 expected to hit $300M at the box office this weekend

    Apple is reportedly working on a cheaper MacBook, but will it stick the landing?

    Apple is reportedly working on a cheaper MacBook, but will it stick the landing?

    Apple @ Work: Macs have never been more expensive to repair, but never been more reliable

    Apple @ Work: Macs have never been more expensive to repair, but never been more reliable

    New Gemini icon comes to Android and iPhone

    New Gemini icon comes to Android and iPhone

    Best Mac SSD and hard drive Prime Day deals 2025: Early discounts

    Best Mac SSD and hard drive Prime Day deals 2025: Early discounts

    This is the letter Donald Trump sent Apple to keep TikTok online

    This is the letter Donald Trump sent Apple to keep TikTok online

    Siri’s future, the original iPhone’s past, and Apple Music’s birthday

    Siri’s future, the original iPhone’s past, and Apple Music’s birthday

  • Retro Rewind
    Retro Rewind: Electronic Games April 1995

    Retro Rewind: Electronic Games April 1995

    Retro Rewind: Electronic Gaming Monthly Magazine Number 55 February 1994

    Retro Rewind: Electronic Gaming Monthly Magazine Number 57 April 1994

    Retro Rewind: Blast from the Past – 35 Iconic Commercials of 1988!

    Retro Rewind: Blast from the Past – 35 Iconic Commercials of 1988!

    Retro Rewind: PC World Magazine August 1998

    Retro Rewind: PC World Magazine August 1998

    Retro Rewind: Computer Shopper Magazine September 1997

    Retro Rewind: Computer Shopper Magazine September 1997

    Retro Rewind: PC Magazine December 2015

    Retro Rewind: PC Magazine December 2015

    Retro Rewind: EDGE Magazine RETRO #1: The Guide to Classic Videogame Playing and Collecting

    Retro Rewind: EDGE Magazine RETRO #1: The Guide to Classic Videogame Playing and Collecting

    Retro Rewind: Computer Gaming World Magazine Issue 73 December 1998

    Retro Rewind: Computer Gaming World Magazine Issue 73 December 1998

    Retro Rewind: Electronic Gaming Monthly Magazine Number 55 February 1994

    Retro Rewind: Electronic Gaming Monthly Magazine Number 55 February 1994

  • Tech Deals
    INNOCN 49″ Curved Gaming Monitor 144Hz Ultrawide 32:9 WDFHD 3840 x 1080P, R1800, 99%…

    INNOCN 49″ Curved Gaming Monitor 144Hz Ultrawide 32:9 WDFHD 3840 x 1080P, R1800, 99%…

    Razer Iskur V2 Gaming Chair: Adaptive Lumbar Support – Adjustable Lumbar Curve – High…

    Razer Iskur V2 Gaming Chair: Adaptive Lumbar Support – Adjustable Lumbar Curve – High…

    Critical Rolls: Boxed Set – 5e RPG Storytelling Cards, 300 Tarot Sized Cards, Tabletop…

    Critical Rolls: Boxed Set – 5e RPG Storytelling Cards, 300 Tarot Sized Cards, Tabletop…

    Nintendogs Dachshund & Friends (Renewed)

    Nintendogs Dachshund & Friends (Renewed)

    Gamer [Blu-ray]

    Gamer [Blu-ray]

    Transcend TS-RDF2 Cfast 2.0 USB 3.1 Card Reader

    Transcend TS-RDF2 Cfast 2.0 USB 3.1 Card Reader

    MaxLLTo USB 3.0 Power Charger Data SYNC Cable Cord for Toshiba External Hard Drive Disk…

    MaxLLTo USB 3.0 Power Charger Data SYNC Cable Cord for Toshiba External Hard Drive Disk…

    Seagate Bulk ST4000NM0033 Constellation ES.3 4TB SATA 6G (Renewed)

    Seagate Bulk ST4000NM0033 Constellation ES.3 4TB SATA 6G (Renewed)

    Seagate Video 2.5 HDD Hard Drive – Internal (ST500VT000)

    Seagate Video 2.5 HDD Hard Drive – Internal (ST500VT000)

  • Tech Eats
    Cheesy Broccoli Rice Mug: 5-Minute Super Comfort Food

    Cheesy Broccoli Rice Mug: 5-Minute Super Comfort Food

    Top 10 Vegetarian Recipes for 2025: Easy and Nutritious Meals for Busy People

    Top 10 Vegetarian Recipes for 2025: Easy and Nutritious Meals for Busy People

    Bacon Mug Lasagna: 5-Minute Microwave Meat Lover’s Dream

    Bacon Mug Lasagna: 5-Minute Microwave Meat Lover’s Dream

    Bacon Fried Rice Mug: 5-Minute Microwave Meal

    Bacon Fried Rice Mug: 5-Minute Microwave Meal

    Bacon & Cheddar Mug Biscuit: 2-Minute Savory Comfort

    Bacon & Cheddar Mug Biscuit: 2-Minute Savory Comfort

    Loaded Bacon Cheesy Potato Mug: 5-Minute Comfort Food

    Loaded Bacon Cheesy Potato Mug: 5-Minute Comfort Food

    Peanut Butter Banana Mug Muffin: 5-Minute Protein Snack

    Peanut Butter Banana Mug Muffin: 5-Minute Protein Snack

    Oreo Mug Cake: 2-Minute Cookie & Cake Combo!

    Oreo Mug Cake: 2-Minute Cookie & Cake Combo!

    Tiramisu Mug Cake: Coffee Lover’s Dream in 2 Minutes!

    Tiramisu Mug Cake: Coffee Lover’s Dream in 2 Minutes!

  • Tesla
    BestEvMod for Refreshed Model 3 Highland Cargo Liner Floor Liners Trunk and Frunk Mat…

    BestEvMod for Refreshed Model 3 Highland Cargo Liner Floor Liners Trunk and Frunk Mat…

    4 PCS Car Front and Rear Side Window Sunshade, 19.6″ x 31.4″ x 7.8″ + 19.6″ x 31.4″ Keep…

    4 PCS Car Front and Rear Side Window Sunshade, 19.6″ x 31.4″ x 7.8″ + 19.6″ x 31.4″ Keep…

    Car Floor Mats for Tesla Cybertruck 2023 2024 2025, Custom TPE All Weather Protection…

    Car Floor Mats for Tesla Cybertruck 2023 2024 2025, Custom TPE All Weather Protection…

    JOYTUTUS Truck Bed Divider Compatible with Cybertruck 2024 2023 Cargo Divider Organizer…

    JOYTUTUS Truck Bed Divider Compatible with Cybertruck 2024 2023 Cargo Divider Organizer…

    HANSSHOW Pet Seat Covers for Cybertruck Rear Dog Seat Protector Full-Cover Waterproof…

    HANSSHOW Pet Seat Covers for Cybertruck Rear Dog Seat Protector Full-Cover Waterproof…

    Center Console Organizer Tray Compatible with Tesla Cybertruck 2024 2025 Accessories,…

    Center Console Organizer Tray Compatible with Tesla Cybertruck 2024 2025 Accessories,…

    Cybertruck Sticker Vinyl Bumper Sticker Decal Waterproof 5″

    Cybertruck Sticker Vinyl Bumper Sticker Decal Waterproof 5″

    JOMISE Dash Cam Front and Rear, 4k FHD Dual Car Camera, 3″ IPS Dash Camera for Cars with…

    JOMISE Dash Cam Front and Rear, 4k FHD Dual Car Camera, 3″ IPS Dash Camera for Cars with…

    Model 3 Badge Emblem – Front Hood and Rear Trunk Replacement Logo for Tesla Model 3-3D…

    Model 3 Badge Emblem – Front Hood and Rear Trunk Replacement Logo for Tesla Model 3-3D…

  • UFO
    SOJOS Retro Polarized Square Sunglasses Womens Men Vintage Double Bridge Metal Frame UV Protection Sun Glasses SJ1246

    SOJOS Retro Polarized Square Sunglasses Womens Men Vintage Double Bridge Metal Frame UV Protection Sun Glasses SJ1246

    Bill Nye on Space Exploration #billnye #science #space #spaceexploration  #masterclass

    Bill Nye on Space Exploration #billnye #science #space #spaceexploration #masterclass

    Nessie and UFO, Sasquatch Rare Selfie, The Loch Ness Bigfoot T-Shirt

    Nessie and UFO, Sasquatch Rare Selfie, The Loch Ness Bigfoot T-Shirt

    Spirit Communication by Rev. Gaurav Tiwari | Indian Paranormal Society

    Spirit Communication by Rev. Gaurav Tiwari | Indian Paranormal Society

    Crumbl Conspiracy Investigation

    Crumbl Conspiracy Investigation

    New York NY City Lights | Ufo sightings in 2021 | Unidentified Flying object

    New York NY City Lights | Ufo sightings in 2021 | Unidentified Flying object

    amBand Compatible for Fitbit Versa 4/3/2/ Fitbit Versa Lite/Fitbit Sense 2/ Fitbit Sense Bands with Case, Protective Smartwatch Case Strap Rugged Sport Protector Wristbands Men Green

    amBand Compatible for Fitbit Versa 4/3/2/ Fitbit Versa Lite/Fitbit Sense 2/ Fitbit Sense Bands with Case, Protective Smartwatch Case Strap Rugged Sport Protector Wristbands Men Green

    Scientists Solve the Mystery Behind the Oumuamua 'Alien Spacecraft' Comet

    Scientists Solve the Mystery Behind the Oumuamua 'Alien Spacecraft' Comet

    Aliens From Outer Space: Ufo Landings, Crashes And Retrievals

    Aliens From Outer Space: Ufo Landings, Crashes And Retrievals

No Result
View All Result
  • TC
  • AI
    Artificial Intelligence

    Transforming network operations with AI: How Swisscom built a network assistant using Amazon Bedrock

    Artificial Intelligence

    EgoDex: Learning Dexterous Manipulation from Large-Scale Egocentric Video

    Artificial Intelligence

    Instruction-Following Pruning for Large Language Models

    Artificial Intelligence

    How to Combine Streamlit, Pandas, and Plotly for Interactive Data Apps

    Artificial Intelligence

    Tailor responsible AI with new safeguard tiers in Amazon Bedrock Guardrails

    Artificial Intelligence

    Automate Data Quality Reports with n8n: From CSV to Professional Analysis

    Artificial Intelligence

    NewDay builds A Generative AI based Customer service Agent Assist with over 90% accuracy

    Artificial Intelligence

    5 Things You Need to Know About Agentic AI

    Artificial Intelligence

    Normalizing Flows are Capable Generative Models

  • App Zone
    Top 3 Launcher Apps for Apple: Features, Pros, and Cons

    Top 3 Launcher Apps for Apple: Features, Pros, and Cons

    Top 3 Launcher Apps for Android: Features, Pros, and Cons

    Top 3 Launcher Apps for Android: Features, Pros, and Cons

    Top 3 Card Game Apps of 2025: Features, Pros, and Cons

    Top 3 Card Game Apps of 2025: Features, Pros, and Cons

    Top 3 Medical Apps of 2025: Features, Pros, and Cons

    Top 3 Medical Apps of 2025: Features, Pros, and Cons

    Top 3 Travel Apps of 2025: Features, Pros, and Cons

    Top 3 Travel Apps of 2025: Features, Pros, and Cons

    Top 3 Casual Game Apps for 2025: Features, Pros, and Cons

    Top 3 Casual Game Apps for 2025: Features, Pros, and Cons

    Top 3 Food Apps for 2025: Features, Pros, and Cons

    Top 3 Food Apps for 2025: Features, Pros, and Cons

    Top 3 Sport Apps for 2025: Features, Pros, and Cons

    Top 3 Sport Apps for 2025: Features, Pros, and Cons

    Top 3 Productivity Apps for 2025: Features, Pros, and Cons

    Top 3 Productivity Apps for 2025: Features, Pros, and Cons

  • Apple
    Yes, you can run Windows 11 on your Mac — and it’s only $15

    Run Windows apps on your Mac with Windows 11 Pro — now just $9.97

    How to stop LG & Samsung smart TV tracking, screen captures

    How to stop LG & Samsung smart TV tracking, screen captures

    Apple’s F1 expected to hit $300M at the box office this weekend

    Apple’s F1 expected to hit $300M at the box office this weekend

    Apple is reportedly working on a cheaper MacBook, but will it stick the landing?

    Apple is reportedly working on a cheaper MacBook, but will it stick the landing?

    Apple @ Work: Macs have never been more expensive to repair, but never been more reliable

    Apple @ Work: Macs have never been more expensive to repair, but never been more reliable

    New Gemini icon comes to Android and iPhone

    New Gemini icon comes to Android and iPhone

    Best Mac SSD and hard drive Prime Day deals 2025: Early discounts

    Best Mac SSD and hard drive Prime Day deals 2025: Early discounts

    This is the letter Donald Trump sent Apple to keep TikTok online

    This is the letter Donald Trump sent Apple to keep TikTok online

    Siri’s future, the original iPhone’s past, and Apple Music’s birthday

    Siri’s future, the original iPhone’s past, and Apple Music’s birthday

  • Retro Rewind
    Retro Rewind: Electronic Games April 1995

    Retro Rewind: Electronic Games April 1995

    Retro Rewind: Electronic Gaming Monthly Magazine Number 55 February 1994

    Retro Rewind: Electronic Gaming Monthly Magazine Number 57 April 1994

    Retro Rewind: Blast from the Past – 35 Iconic Commercials of 1988!

    Retro Rewind: Blast from the Past – 35 Iconic Commercials of 1988!

    Retro Rewind: PC World Magazine August 1998

    Retro Rewind: PC World Magazine August 1998

    Retro Rewind: Computer Shopper Magazine September 1997

    Retro Rewind: Computer Shopper Magazine September 1997

    Retro Rewind: PC Magazine December 2015

    Retro Rewind: PC Magazine December 2015

    Retro Rewind: EDGE Magazine RETRO #1: The Guide to Classic Videogame Playing and Collecting

    Retro Rewind: EDGE Magazine RETRO #1: The Guide to Classic Videogame Playing and Collecting

    Retro Rewind: Computer Gaming World Magazine Issue 73 December 1998

    Retro Rewind: Computer Gaming World Magazine Issue 73 December 1998

    Retro Rewind: Electronic Gaming Monthly Magazine Number 55 February 1994

    Retro Rewind: Electronic Gaming Monthly Magazine Number 55 February 1994

  • Tech Deals
    INNOCN 49″ Curved Gaming Monitor 144Hz Ultrawide 32:9 WDFHD 3840 x 1080P, R1800, 99%…

    INNOCN 49″ Curved Gaming Monitor 144Hz Ultrawide 32:9 WDFHD 3840 x 1080P, R1800, 99%…

    Razer Iskur V2 Gaming Chair: Adaptive Lumbar Support – Adjustable Lumbar Curve – High…

    Razer Iskur V2 Gaming Chair: Adaptive Lumbar Support – Adjustable Lumbar Curve – High…

    Critical Rolls: Boxed Set – 5e RPG Storytelling Cards, 300 Tarot Sized Cards, Tabletop…

    Critical Rolls: Boxed Set – 5e RPG Storytelling Cards, 300 Tarot Sized Cards, Tabletop…

    Nintendogs Dachshund & Friends (Renewed)

    Nintendogs Dachshund & Friends (Renewed)

    Gamer [Blu-ray]

    Gamer [Blu-ray]

    Transcend TS-RDF2 Cfast 2.0 USB 3.1 Card Reader

    Transcend TS-RDF2 Cfast 2.0 USB 3.1 Card Reader

    MaxLLTo USB 3.0 Power Charger Data SYNC Cable Cord for Toshiba External Hard Drive Disk…

    MaxLLTo USB 3.0 Power Charger Data SYNC Cable Cord for Toshiba External Hard Drive Disk…

    Seagate Bulk ST4000NM0033 Constellation ES.3 4TB SATA 6G (Renewed)

    Seagate Bulk ST4000NM0033 Constellation ES.3 4TB SATA 6G (Renewed)

    Seagate Video 2.5 HDD Hard Drive – Internal (ST500VT000)

    Seagate Video 2.5 HDD Hard Drive – Internal (ST500VT000)

  • Tech Eats
    Cheesy Broccoli Rice Mug: 5-Minute Super Comfort Food

    Cheesy Broccoli Rice Mug: 5-Minute Super Comfort Food

    Top 10 Vegetarian Recipes for 2025: Easy and Nutritious Meals for Busy People

    Top 10 Vegetarian Recipes for 2025: Easy and Nutritious Meals for Busy People

    Bacon Mug Lasagna: 5-Minute Microwave Meat Lover’s Dream

    Bacon Mug Lasagna: 5-Minute Microwave Meat Lover’s Dream

    Bacon Fried Rice Mug: 5-Minute Microwave Meal

    Bacon Fried Rice Mug: 5-Minute Microwave Meal

    Bacon & Cheddar Mug Biscuit: 2-Minute Savory Comfort

    Bacon & Cheddar Mug Biscuit: 2-Minute Savory Comfort

    Loaded Bacon Cheesy Potato Mug: 5-Minute Comfort Food

    Loaded Bacon Cheesy Potato Mug: 5-Minute Comfort Food

    Peanut Butter Banana Mug Muffin: 5-Minute Protein Snack

    Peanut Butter Banana Mug Muffin: 5-Minute Protein Snack

    Oreo Mug Cake: 2-Minute Cookie & Cake Combo!

    Oreo Mug Cake: 2-Minute Cookie & Cake Combo!

    Tiramisu Mug Cake: Coffee Lover’s Dream in 2 Minutes!

    Tiramisu Mug Cake: Coffee Lover’s Dream in 2 Minutes!

  • Tesla
    BestEvMod for Refreshed Model 3 Highland Cargo Liner Floor Liners Trunk and Frunk Mat…

    BestEvMod for Refreshed Model 3 Highland Cargo Liner Floor Liners Trunk and Frunk Mat…

    4 PCS Car Front and Rear Side Window Sunshade, 19.6″ x 31.4″ x 7.8″ + 19.6″ x 31.4″ Keep…

    4 PCS Car Front and Rear Side Window Sunshade, 19.6″ x 31.4″ x 7.8″ + 19.6″ x 31.4″ Keep…

    Car Floor Mats for Tesla Cybertruck 2023 2024 2025, Custom TPE All Weather Protection…

    Car Floor Mats for Tesla Cybertruck 2023 2024 2025, Custom TPE All Weather Protection…

    JOYTUTUS Truck Bed Divider Compatible with Cybertruck 2024 2023 Cargo Divider Organizer…

    JOYTUTUS Truck Bed Divider Compatible with Cybertruck 2024 2023 Cargo Divider Organizer…

    HANSSHOW Pet Seat Covers for Cybertruck Rear Dog Seat Protector Full-Cover Waterproof…

    HANSSHOW Pet Seat Covers for Cybertruck Rear Dog Seat Protector Full-Cover Waterproof…

    Center Console Organizer Tray Compatible with Tesla Cybertruck 2024 2025 Accessories,…

    Center Console Organizer Tray Compatible with Tesla Cybertruck 2024 2025 Accessories,…

    Cybertruck Sticker Vinyl Bumper Sticker Decal Waterproof 5″

    Cybertruck Sticker Vinyl Bumper Sticker Decal Waterproof 5″

    JOMISE Dash Cam Front and Rear, 4k FHD Dual Car Camera, 3″ IPS Dash Camera for Cars with…

    JOMISE Dash Cam Front and Rear, 4k FHD Dual Car Camera, 3″ IPS Dash Camera for Cars with…

    Model 3 Badge Emblem – Front Hood and Rear Trunk Replacement Logo for Tesla Model 3-3D…

    Model 3 Badge Emblem – Front Hood and Rear Trunk Replacement Logo for Tesla Model 3-3D…

  • UFO
    SOJOS Retro Polarized Square Sunglasses Womens Men Vintage Double Bridge Metal Frame UV Protection Sun Glasses SJ1246

    SOJOS Retro Polarized Square Sunglasses Womens Men Vintage Double Bridge Metal Frame UV Protection Sun Glasses SJ1246

    Bill Nye on Space Exploration #billnye #science #space #spaceexploration  #masterclass

    Bill Nye on Space Exploration #billnye #science #space #spaceexploration #masterclass

    Nessie and UFO, Sasquatch Rare Selfie, The Loch Ness Bigfoot T-Shirt

    Nessie and UFO, Sasquatch Rare Selfie, The Loch Ness Bigfoot T-Shirt

    Spirit Communication by Rev. Gaurav Tiwari | Indian Paranormal Society

    Spirit Communication by Rev. Gaurav Tiwari | Indian Paranormal Society

    Crumbl Conspiracy Investigation

    Crumbl Conspiracy Investigation

    New York NY City Lights | Ufo sightings in 2021 | Unidentified Flying object

    New York NY City Lights | Ufo sightings in 2021 | Unidentified Flying object

    amBand Compatible for Fitbit Versa 4/3/2/ Fitbit Versa Lite/Fitbit Sense 2/ Fitbit Sense Bands with Case, Protective Smartwatch Case Strap Rugged Sport Protector Wristbands Men Green

    amBand Compatible for Fitbit Versa 4/3/2/ Fitbit Versa Lite/Fitbit Sense 2/ Fitbit Sense Bands with Case, Protective Smartwatch Case Strap Rugged Sport Protector Wristbands Men Green

    Scientists Solve the Mystery Behind the Oumuamua 'Alien Spacecraft' Comet

    Scientists Solve the Mystery Behind the Oumuamua 'Alien Spacecraft' Comet

    Aliens From Outer Space: Ufo Landings, Crashes And Retrievals

    Aliens From Outer Space: Ufo Landings, Crashes And Retrievals

No Result
View All Result
Techcratic
No Result
View All Result
Home Hacker News

Alignment is not free: How model upgrades can silence your confidence signals

Hacker News by Hacker News
May 6, 2025
in Hacker News
Reading Time: 7 mins read
125
A A
0

2025-05-06 19:22:00
www.variance.co

The Flattening Calibration Curve

The post-training process for LLMs can bias behavior for language models when they encounter content that violates their safety post-training guidelines. As mentioned by OpenAI’s GPT-4 system card, model calibration rarely survives post-training, resulting in models that are extremely confident even when they’re wrong.¹ For our use case, we often see this behavior with the side effect of biasing language model outputs towards violations, which can result in wasted review times for human reviewers in an LLM-powered content moderation system.

Pre-training vs. Post-preference optimization calibration curves

‍

A Working Signal on GPT-4o

Take the below histogram of log probs sampled from a golden dataset of false positives against GPT-4o. We can see that almost all outputs have log p≈0 nats (probability ≈ 1) for outputting “true”, indicating a true violation in this dataset.

However, there are a few outliers in this dataset, almost all of which correspond to patterns of behavior we observed in our dataset when our model would stray away from formal grounded policy definitions, or hallucinations in content or policy violations.

The functional confidence signal in GPT-4o

This results in a functional enough ROC curve that’s helpful for calibrating our model to ignore these outputs, and perform tasks like flagging the content for review or suppress the output as likely spurious. 

The Upgrade That Vanished Uncertainty

However, what we found is that after switching to GPT-4.1-mini, this signal vanishes. Although we’re still able to measure log probs for other tokens in our structured outputs, each token was 100% confident that it should return true in this dataset, which completely destroyed our signal.

Why does a smaller sibling of the same model family erase so much information? It’s possible that due to the heavy distillation that occurs to train 4-1 mini for binary decisions (such as outputting a boolean field in a structured output), the dimension is collapsed entirely: the student is taught to emit the right answer and ignore entropy at all. This results in no usable confidence signal.

We tried several other approaches to recover the lost uncertainty signal, all unsuccessful:

  1. Entropy differential hypothesis: We measured entropy between content array vs. chain-of-thought mean, with the theory that hallucinated violations would be wordier/less confident. In practice, we were unable to find a signal here
  2. Span consistency check: We analyzed standard deviation of span log-probs, hoping for variation between true/false cases. In practice, both classes showed σ≈0.018 (identical).
  3. Perplexity analysis: We calculated token-level perplexity averages across all samples. In practice, we found similar metrics for every sample, safe or unsafe.
Failed attempts to recover uncertainty signals in GPT-4.1-mini

The net result is that we’ve lost our signal for hallucinations! All of these features rely on local entropy surviving RLHF, and we don’t have anywhere to look for these signals, requiring new heuristics for model upgrades to solve these failure cases, to re-introduce some uncertainty measures.

In response to this lost hallucination signal, we’ve implemented several alternative safeguards. These new methods, such as formally requiring policy explanations to be fully grounded in actual data/quotes, are powering new features in our product towards better explainability and policy iteration, but do show how there’s more to model upgrades than simply benchmark upgrades.

Our current approach relies on more explicit controls: requiring detailed explanations from the model for each policy violation, demanding specific policy citations to ground decisions, and implementing filtering systems to catch corrupted outputs when policies are hallucinated.

However, the closed-source nature of these models significantly limits our access to internal signals beyond log probabilities. As models continue to be further distilled for efficiency, even these limited signals are fading, creating a growing challenge for reliable uncertainty detection especially when working with closed-source models.

Alignment isn’t free

In our situation, the improvements to steerability and performance upgrades of 4.1 were worth it for customers and our internal workarounds were sufficient to actually increase precision with our latest release. A model upgrade is not merely a drop-in performance bump; it is a distributional shift that can invalidate an entire AI stack. Anyone shipping high-precision systems should log raw logits, tie heuristics to specific model versions, and invest in alternative product safeguards. Alignment makes models safer for users but simultaneously masks their own uncertainty from engineers; the burden of re-exposing that uncertainty falls on us.

1. *OpenAI GPT‑4 System Card*, § 6.2 “Calibration”: “We observe that RLHF improves helpfulness but can distort the model’s probability estimates; after alignment the model tends to be over‑confident on both correct and incorrect answers.

‍

Source Link


Keep your files stored safely and securely with the SanDisk 2TB Extreme Portable SSD. With over 69,505 ratings and an impressive 4.6 out of 5 stars, this product has been purchased over 8K+ times in the past month. At only $129.99, this Amazon’s Choice product is a must-have for secure file storage.

Help keep private content private with the included password protection featuring 256-bit AES hardware encryption. Order now for just $129.99 on Amazon!


Start your free Amazon Prime trial
today and unlock unlimited streaming and more!

Help Power Techcratic’s Future – Scan To Support

If Techcratic’s content and insights have helped you, consider giving back by supporting the platform with crypto. Every contribution makes a difference, whether it’s for high-quality content, server maintenance, or future updates. Techcratic is constantly evolving, and your support helps drive that progress.

As a solo operator who wears all the hats, creating content, managing the tech, and running the site, your support allows me to stay focused on delivering valuable resources. Your support keeps everything running smoothly and enables me to continue creating the content you love. I’m deeply grateful for your support, it truly means the world to me! Thank you!

BITCOIN

Bitcoin Logo

Bitcoin QR Code

bc1qlszw7elx2qahjwvaryh0tkgg8y68enw30gpvge

Scan the QR code with your crypto wallet app

DOGECOIN

Dogecoin Logo

Dogecoin QR Code

D64GwvvYQxFXYyan3oQCrmWfidf6T3JpBA

Scan the QR code with your crypto wallet app

ETHEREUM

Ethereum Logo

Ethereum QR Code

0xe9BC980DF3d985730dA827996B43E4A62CCBAA7a

Scan the QR code with your crypto wallet app

Please read the Privacy and Security Disclaimer on how Techcratic handles your support.

Disclaimer: As an Amazon Associate, Techcratic may earn from qualifying purchases.

Tags: Hacker News
Share162Share28ShareShare4ShareTweet101
Previous Post

Technical Product Manager – Oracle Data Protection Technologies

Next Post

US Dollar Supremacy Is Cracking as Erosion Deepens, Devere Warns

Hacker News

Hacker News

Stay updated with Hacker News, where technology meets entrepreneurial spirit. Get the latest on tech trends, startup news, and discussions from the tech community. Read the latest updates here at Techcratic.

Related Posts

News Alert Immediately – Instant News Alerts & Global Monitoring
Hacker News

News Alert Immediately – Instant News Alerts & Global Monitoring

July 6, 2025
1.3k
hackArcana
Hacker News

hackArcana

July 6, 2025
1.3k
Differentiable Programming with PyTorch and DSPy
Hacker News

Differentiable Programming with PyTorch and DSPy

July 5, 2025
1.3k
The Right Way to Embed an LLM in a Group Chat
Hacker News

The Right Way to Embed an LLM in a Group Chat

July 5, 2025
1.3k
Cybersecurity
Hacker News

How to get into cybersecurity

July 5, 2025
1.3k
Local First Software Is Easier to Scale
Hacker News

Local First Software Is Easier to Scale

July 5, 2025
1.3k
GNU Taler
Hacker News

GNU Taler

July 5, 2025
1.3k
Impact of PCIe 5.0 Bandwidth on GPU Content Creation Performance
Hacker News

Impact of PCIe 5.0 Bandwidth on GPU Content Creation Performance

July 5, 2025
1.3k
Load More
Next Post
US Dollar Supremacy Is Cracking as Erosion Deepens, Devere Warns

US Dollar Supremacy Is Cracking as Erosion Deepens, Devere Warns

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Tech Resources

  • 30 Second Tech ™
  • AI
  • App Zone ™
  • Apple
  • Ars Technica
  • CNET
  • ComputerWorld
  • Crypto News
  • Cybersecurity
  • Endgadget
  • Forbes
  • Fossbytes
  • Gaming
  • GeekWire
  • Gizmodo
  • Google News
  • Hacker News
  • Harvard Tech
  • I Like Cats ™
  • I Like Dogs ™
  • LifeHacker
  • MacRumors
  • Macworld
  • Mashable
  • Microsoft
  • MIT Tech
  • PC World
  • Photofocus
  • Physics
  • Random Tech
  • Retro Rewind ™
  • Robot Report
  • SiliconANGLE
  • SlashGear
  • Smartphone
  • StackSocial
  • Tech Art
  • Tech Careers
  • Tech Deals
  • Techcratic ™
  • TechCrunch
  • Techdirt
  • TechRepublic
  • Techs Got To Eat ™
  • TechSpot
  • Tesla
  • The Verge
  • TNW
  • Trusted Reviews
  • UFO
  • VentureBeat
  • Visual Capitalist
  • Wired
  • ZDNet

Tech News

  • 30 Second Tech ™
  • AI
  • Apple Insider
  • Ars Technica
  • CNET
  • ComputerWorld
  • Crypto News
  • Cybersecurity
  • Endgadget
  • ExtremeTech
  • Fossbytes
  • Gaming
  • GeekWire
  • Gizmodo

Tech News

  • Harvard Tech
  • MacRumors
  • Macworld
  • Mashable
  • Microsoft
  • MIT Tech
  • Physics
  • PC World
  • Random Tech
  • Retro Rewind ™
  • SiliconANGLE
  • SlashGear
  • Smartphone
  • StackSocial
  • Tech Careers

Tech News​

  • Tech Art
  • TechCrunch
  • Techdirt
  • TechRepublic
  • Techs Got To Eat ™
  • TechSpot
  • Tesla
  • The Verge
  • TNW
  • Trusted Reviews
  • UFO
  • VentureBeat
  • Visual Capitalist
  • Wired
  • ZDNet

Site Links

  • About Techcratic
  • Affiliate Disclaimer
  • Affiliate Link Policy
  • Contact Techcratic
  • Dealors Discount Store
  • Privacy and Security Disclaimer
  • Privacy Policy
  • RSS Feed
  • Site Map
  • Support Techcratic
  • Techcratic
  • Tech Deals
  • TOS
  • 𝕏
Click For A Secret Deal

Techcratic – Your All In One Tech Hub © 2020 – 2025
All Rights Reserved
∞

No Result
View All Result
  • 30 Second Tech ™
  • AI
  • App Zone ™
  • Apple
  • Ars Technica
  • CNET
  • Crypto News
  • Cybersecurity
  • Endgadget
  • Gaming
  • I Like Cats ™
  • I Like Dogs ™
  • MacRumors
  • Macworld
  • Tech Deals
  • Techcratic ™
  • Techs Got To Eat ™
  • Tesla
  • UFO
  • Wired