Police officers are starting to use AI chatbots to write crime reports. Will they hold up in court?
Lying to people is the only thing AI is good for, so its no shock that cops want to use it
Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.
This is not debate club. Unless it’s amusing debate.
For actually-good tech, you want our NotAwfulTech community
Police officers are starting to use AI chatbots to write crime reports. Will they hold up in court?
Lying to people is the only thing AI is good for, so its no shock that cops want to use it
Finally it is today on the AS. So I can post my link. The AI Guys Are Driving Themselves Mad (nymag link)
REAL
Oof, real Qanon flavor there.
https://www.404media.co/this-is-doom-running-on-a-diffusion-model/
We can boil the oceans to run a worse version of a game that can run at 60fps on a potato, but the really cool part is that we need the better version of the game to exist in the first place and also the new version only runs at 20fps.
Oh god is this the first time we have to sneer at a 404 article? Let's hope it will be the last.
It's running at frames per second, not seconds per frame. so it's not too energy intensive compared with the generative versions.
it’s interesting that the only real “hallucination” I can see in the video pops up when the player shoots an enemy, which results in some blurry feedback animations
Ah yes, issues appear when shooting an enemy, in a shooter game. Definitely not proof that the technology falls apart when it's made to do the thing that it was created to do.
e: The demos made me motion sick. Random blobs of colour appearing at random and floor textures shifting around aren't hallucinations?
yeah, this is weirdly sneerable for a 404 article, and I hope this isn’t an early sign they’ve enshittifying. let’s do what they should have and take a critical look at, ah, GameNGen, a name for their research they surely won’t regret
Diffusion Models Are Real-Time Game Engines
wow! it’s a shame that creating this model involved plagiarizing every bit of recorded doom footage that’s ever existed, exploited an uncounted number of laborers from the global south for RLHF, and burned an amount of rainforest in energy that also won’t be counted. but fuck it, sometimes I shop at Walmart so I can’t throw stones and this sounds cool, so let’s grab the source and see how it works!
just kidding, this thing’s hosted on github but there’s no source. it’s just a static marketing page, a selection of videos, and a link to their paper on arXiv, which comes in at a positively ultralight 10 LaTeX-formatted letter-sized pages when you ignore the many unhelpful screenshots and graphs they included
so we can’t play with it, but it’s a model implementing a game engine, right? so the evaluation strategy given in the paper has to involve the innovative input mechanism they’ve discovered that enables the model to simulate a gameplay loop (and therefore a game engine), right? surely that’s what convinced a pool of observers with more-than-random-chance certainty that the model was accurately simulating doom?
Human Evaluation. As another measurement of simulation quality, we provided 10 human raters with 130 random short clips (of lengths 1.6 seconds and 3.2 seconds) of our simulation side by side with the real game. The raters were tasked with recognizing the real game (see Figure 14 in Appendix A.6). The raters only choose the actual game over the simulation in 58% or 60% of the time (for the 1.6 seconds and 3.2 seconds clips, respectively).
of course not. nowhere in this paper is their supposed innovation in input actually evaluated — at no point is this work treated experimentally like a real-time game engine. also, and you pointed this out already — were the human raters drunk? (honestly, I couldn’t blame them — I wouldn’t give a shit either if my mturk was “which of these 1.6 second clips is doom”) the fucking thing doesn’t even simulate doom’s main gameplay loop right; dead possessed marines just turn to a blurry mess, health and armor don’t make sense in any but the loosest sense, it doesn’t seem to think imps exist at all but does randomly place their fireballs where they should be, and sometimes the geometry it’s simulating just casually turns into a visual paradox. chances are this experimental setup was tuned for the result they wanted — they managed to trick 40% of a group of people who absolutely don’t give a fuck that the incredibly short video clip they were looking at was probably a video game. amazing!
if we ever get our hands on the code for this thing, I’m gonna make a prediction: it barely listens to input, if at all. the video clips they’ve released on their site and YouTube are the most coherent this thing gets, and it instantly falls apart the instant you do anything that wasn’t in its training set (aka, the instant you use this real-time game engine to play a game and do something unremarkably weird, like try to ram yourself through a wall)
The paper is so bad...
the agent's policy π ... the environment ε
What is up with AI papers using fancy symbols to notate abstract concepts when there isn't a single other instance of the concept to be referred to
They offer a bunch of tables with numbers in a metric that isn't explained, showing that they are exactly the same for "random" and "agent" policy, in other words, inputs don't actually matter! And they say they want to use these metrics for training future versions. Good luck.
For the sample size they are using 60% seems like a statistically significant rate, and they only tested at most 3 seconds after real gameplay footage.
Sidenote: Auto-regressive models for much shorter periods are really useful for when audio is cutting out. Those use really simple math, they aren't burning any rainforests
I'm willing to retract my statement that these guys don't have any ulterior motives.
The paper starts with a weirdly bad definition of "computer game" too. It almost makes me think that (gasp) the paper was written by non-gamers.
Computer games are manually crafted software systems centered around the following game loop: (1) gather user inputs, (2) update the game state, and (3) render it to screen pixels. This game loop, running at high frame rates, creates the illusion of an interactive virtual world for the player.
No rendering: Myst
No frame rate: Zork
No pixels: Asteroids
No virtual world: Wordle
No screen: Soundvoyager, Audio Defense (well these examples have a vestigial screen, but they supposedly don't really need it)
were the human raters drunk? (honestly, I couldn’t blame them — I wouldn’t give a shit either if my mturk was “which of these 1.6 second clips is doom”)
"I'unno, I'm fuckin' wasted and guessin' at random."
"So, your P(doom) is 50%."
Not a sneer, but another cool piece from Baldur Bjarnason: The slow evaporation of the free/open source surplus.
Gonna skip straight to near the end, where Baldur lays out a potential apocalypse scenario for FOSS as we know it:
Best case scenario, seems to me, is that Free and Open Source Software enters a period of decline. After all, that’s generally what happens to complex systems with less investment. Worst case scenario is a vicious cycle leading to a collapse:
Declining surplus and burnout leads to maintainers increasingly stepping back from their projects.
Many of these projects either bitrot serious bugs or get taken over by malicious actors who are highly motivated because they can’t relay on pervasive memory bugs anymore for exploits.
OSS increasingly gets a reputation (deserved or not) for being unsafe and unreliable.
That decline in users leads to even more maintainers stepping back.
Linking this to a related sneer, another major problem that I can see befalling FOSS is earning a reputation as a Nazi bar. How high that risk is I'm not sure, but between the AI bubble shredding tech's public image and our very good friends increasingly catching the public's attention, I suspect those chances are pretty high.
When the revolution comes, we have to cut off the hands wearing this abomination of a watch before we cut off the owner's head:
https://www.ablogtowatch.com/new-release-jacob-co-oil-pump-44mm-watch/
Price: $280k, limited to 88 pieces...
I can only assume the 88 is a coincidence. it's just a coincidence, right? right?
they live glasses on
I THINK ILLEGAL IMMIGRANTS ARE DOING ELECTION FRAUD
DUAL CITIZENSHIP EXISTS
LET ME MISS THE POINT EQUALLY HARD IN A DIFFERENT DIRECTION, OR AT LEAST HOPEFULLY SO
I hate to say it but pg's tweet was reasonable. Some asshole carrying water for Mike fucking Lee deserves to be blocked.
So the orange site is having a normal one over Python BFDL trying to skirt CoC by talking about mod actions against some old dude who caught a suspension for being precisely the sort of edgelord poaster I'd expect out of a Python maintainer, which the orange site was also not happy about. I even read a bunch of his posts in the thread, like where he calls people standing up to NixOS leadership "true villains".
oh my god, that weird fash fucker is absolutely pulling a NixOS and trying to burn down the Python community over a well-deserved 3 month suspension
and the only reason I know about this shit even though I’m barely involved with Python in any regard is because one of his fans/alts was spamming mastodon with a blog post defending him, and fully half of it by scroll bar position was just fluffing the fucker’s previous achievements, then at almost exactly the halfway point it started describing all the shit he did and hoo boy does he deserve a lot more than a 3 month suspension
it’s fascinating how this is almost exactly the same situation as with what’s-his-face getting suspended from Nix and the project’s older maintainers pulling ranks to get the toxic fucker back
So it's entirely unclear from that HN thread, but where did this dumbassery start?
it probably isn’t exactly where it started as the entire thing’s in bad faith, but I’ve found the blog post being spammed absolutely everywhere at the time that went into excruciating detail on tim’s history with python then tried its best (and absolutely failed) to paper over and misrepresent the shit Tim did that got him temporarily ejected
e: my strong personal impression is that Tim’s just been like this for 30 years, and nobody managed to call him out before cause he’s the Timsort guy and open source projects always seem to think technical achievement should absolve you of all the other shit you do, regardless of how much that shit damages the project technically
Defending “reverse racism” and “reverse sexism”,
lol yeah this was the line at which the post revealed its entire ass
it’s remarkable that the post spends so many paragraphs priming the reader into thinking Tim’s an irreplaceable part of the Python community and should never have been suspended, and now at least two people have gotten to that exact sentence and gone “no actually fuck Tim”
if memory serves, it gets much more mask off from there, but I remember I didn’t finish the entire thing before I closed the tab and started blocking Tim’s fans