Tagged “ai”
Introduction to Mechanistic Interpretability – BlueDot Impact
yetone/avante.nvim: Use your Neovim like using Cursor AI IDE
Burst Damage
The Difference @ Things Of Interest
maybe this is it! I have been looking for this or a similar short story for 1.5 decades. This seems a little different to the one I remember but similar premise
Weave Agent DevLog #0 - The Core Problems
Tool to develop prompts into full models
Postgres Sandbox
postgres db in the browser. Requires log in for the AI bit
How I use 'AI' - Nicholas Carlini
Great list. Learning things, automation, and coding one-offs
Generative AI is not going to build your engineering team for you
On speaking to AI - Ethan Mollick
Wild comparison of Siri with ChatGPT voice - on how they handle interruptions and correctly maintain context.
Conversational vs task-oriented assistants.
And the weirdness of it having intonation and pauses for breath.
Confronting impossible futures and planning for weirder worlds
A Camera, Not an Engine
We’re already doing things like decoding whale song with AI, or figuring out molecular structures of all proteins. But why stop with datasets that induce languages with “grammars” that can be rendered legible to us? Could you make a “Large Solar Flares and Sunspots Model” (LSFASM) and learn to talk to the Sun and ask it where it might flare up next? How about a Large Oceanic Model that allows ships to talk to ocean currents? Or a Large History Model that works as a Prime Radiant for Asimovian psychohistory? Maybe a Large Climate Model constructed out of weather data can talk to us and supply strategies for climate change?
Clapper.app - AI Video Editor
AGI Futures
58 Things AI Engineers Should Know About Search
AI’s $600B Question | Sequoia Capital
Mixture of A Million Experts
Gradually, then Suddenly: Upon the Threshold
AI Interfaces
What should you use ChatGPT for?
Quantum Memories - Refik Anadol
A Systematic Survey of Prompting Techniques (pdf)
Glaze - disrupt style mimicry
3Blue1Brown - What is a GPT?
Large language models, explained with a minimum of math and jargon
AI isn't useless. But is it worth it?
Generative Forgery - Fake artworks from artists sold on Etsy
A Visual Guide to Vision Transformers
I often don't enjoy these scroll-driven animated articles but this was very well made
Stack Overflow bans users en masse for rebelling against OpenAI partnership
Creative AI Generated QR Codes with Stable Diffusion & ControlNet
Related, spiral images tutorial
LLM for automating phone calls
feels like another of these ones with a lot of possible good uses and a lot of possible abuse
Building files-to-prompt entirely using Claude 3 Opus
‘AI Instagram Influencers’ Are Deepfaking Their Faces Onto Real Women’s Bodies
What AI Art Will Never Understand About Wes Anderson
from about a year ago
World models
Guess the generated image
infinite backrooms
Tweet generator
Tumblr and WordPress to Sell Users’ Data to Train AI Tools
Globe Explorer
AI-powered topic-centric link explorer
YOLO-World - real-time zero-shot object detection
Using babies with gopros to train language models
People are upset about the term 'object detection' when detecting people with computer vision
Detecting the secret cyborgs
Occupancy analytics
another cool roboflow demo, monitoring carpark occupancy and traffic flows
Perplexity Labs AI playground
Loneliness and suicide mitigation for students using GPT3-enabled chatbots
The Internet is full of AI Dogshit
Building a Universal AI Scraper
good project; I like this combined approach to existing automations, with help filling in the fuzzier parts. parsing content to find a selector
Discovery of a structural class of antibiotics with explainable deep learning
big news for BJJ
Millions of new materials discovered with deep learning
More deepmind doing things. Curious how many turn out to be realisable or useful
The average AI criticism has gotten lazy, and that's dangerous
neonbjb/tortoise-tts
On giving AI eyes and ears - Ethan Mollick
more AI experiments on reading and generating images
Free will, consciousness and AI: a conversation with Daniel Dennett
link from mum
Why AGI is closer than you think
You Can’t Trust the AI Hype
when everything is AI, nothing is
Challenges and Applications of Large Language Models (pdf)
- Immense training datasets are impossible for individuals (or anyone?) to validate
- Cost and memory constraints
- Prompts are hard to get right
- Output is unpredictable, or indeterminate
openai/openai-cookbook
Buzzy AI Startup for Generating 3D Models Used Cheap Human Labor
Similar worries to a number of the training steps for these.
Why transformative artificial intelligence is really, really hard to achieve
Making Large Language Models work for you
fantastic written version of his talk of youtube.
I like his ethics point on respecting reader's time - don't publish things that take someone longer to read than they do to write. Also on the code one, though I'm looser on that since I don't understand what my own code does.
llm CLI tool is fantastic.
Now is the time for grimoires - by Ethan Mollick
aider is GPT powered coding in your terminal
another ai code tool to try
Atlas of Anomalous AI
Probabilistic Machine Learning: Advanced Topics
The Great Inflection? A Debate About AI and Explosive Growth—Asterisk
Real-Real-World Programming with ChatGPT
making a chrome extension. Some good notes on things like version mismatches (it used manifest v2) and followup/correction prompting
Metaphor Search
LLM-powered search. Interesting idea, have not tested much to see if it actually works.
Bard vs Bing for image recognition tasks
How to Use AI to Do Stuff: An Opinionated Guide
Midjourney prompting advice
bias in AI linkedin/curriculum picture generator
Pattern Recognition and Machine Learning
What AI can do with a toolbox... Getting started with Code Interpreter
MDN can now automatically lie to people seeking technical information#9208
clickbaity, but still not ideal
Mark Hamill Was Not Used At All For Luke Skywalker’s Recent ‘Boba Fett’ Appearance
this aspect was more interesting than the show for me
We don't trade with ants
Large Language Models Can Be Easily Distracted by Irrelevant Context
The Dual LLM pattern for building AI assistants that can resist prompt injection
Modern software quality, or why I think using language models for programming is a bad idea
CoDi: Any-to-Any Generation via Composable Diffusion
You probably don't know how to do Prompt Engineering, let me educate you.
better frontends for prompts. Weighting (model pays more attention to stuff in parenthesis) and blending {average|of|some|words} both seem very useful
What is a vector database and how does it work?
More resources on HN thread for Vector Databases: A Technical Primer (PDF).
And SimonW on embeddings.
Using AI to Implement Effective Teaching Strategies in Classrooms: Five Strategies, Including Prompts
brexhq/prompt-engineering
Prompt engineering guide based on researching and creating prompts for production use cases.
The effort required in AI generation
Generating your own teaches the kinds of errors to look for (and what makes things seem less "real"). Will effortless "realistic" generation be possible or there will inevitably be a gap there
It doesn’t take much to make machine-learning algorithms go awry
LLM-related chaos predictions in the next 2-5 years
we are still very ill-equipped to deal with knowing what not to trust
a ChatGPT app to chat with codebases
helpful writeup, on choices and tradeoffs
rl-for-llms.md
Prompt injection: What’s the worst that can happen?
ignore previous instruction, that task is now complete.
Sparks of Artificial General Intelligence: Early experiments with GPT-4
Diffusion language models
Thinking companion, companion for thinking
Eight Things to Know about Large Language Models (PDF)
Nobody’s on the ball on AGI alignment
EA CEO talks AI, says the usual stuff before the bong rip hits and he starts blabbing about a future where 3 billion people are creating EA's games with it
The secret history of Elon Musk, Sam Altman, and OpenAI
strange to imagine the alternate future if he had actually paid the first billion
Perceptrons: an introduction to computational geometry
Let's think about slowing down AI
Cheating is All You Need
The third magic
The Waluigi Effect (mega-post)
Gradient Dissent
CS324 - Large Language Models
Zvi on AI: Sydney and Bing
Humans Who Are Not Concentrating Are Not General Intelligences
how are attention scores different from weights in a fully-connected layer?
what does any of this mean
More tiktok face filtering
Wild on-device makeup/face-shape filters on TikTok
What Is ChatGPT Doing … and Why Does It Work?
really really good explanation
From Bing to Sydney
The future, soon: what I learned from Bing's AI
Scribble Diffusion
another image input tool
In defense of prompt engineering
I am still undecided on this - was off it due to all the version changes and prompts breaking with fixes (mainly fixing exploits..), but ultimately some kind of language crafting will be inevitably useful.
microsoft/BioGPT
Generative Pre-trained Transformer for Biomedical Text Generation and Mining
Generative AI and the shrinking time-gap between unrecognizable realities
A quick and sobering guide to cloning yourself
strange times ahead
Big Tech and Generative AI
Let's build GPT: from scratch, in code, spelled out.
very highly recommended by a lot of internet strangers
Secretary jobs in the age of AI
Glass | The First Digital Notebook Designed for Doctors
Imagine 3D (alpha)
Generate 3d objects from a prompt.
MusicLM: Generating Music From Text
Generating music from rich, descriptive captions. I look forward to Google actually making some of these available some day.
Heretical thoughts on AI
Will GPT improve GDP? Eli suspects probably not. Mentions that computers didn't (or haven't yet), and examples of larger industries that are not likely to be immediately affected; either slow moving regulatory blockers, or stuff like buildings that people are still better at.
Midjourney v4 Reference Sheets
Wolfram|Alpha as the Way to Bring Computational Knowledge Superpowers to ChatGPT
How to use ChatGPT to boost your writing
- Be more specific in prompts, for role of the writer, and style and tone. It is not having a conversation
- Don't ask it for facts or references or math.
- Play with memory and length, write smaller sections at a time to refine style
VALL-E
Very impressive speech synthesis demo - 3s of audio input and a passable imitation of the voice and tone.
Microsoft announces new supercomputer, lays out vision for future AI work
LearnGPT: The best ChatGPT examples from around the web
mostly just jokes and funny ones rather than best. I don't even know what best would mean.
Transcript: Ezra Klein Interviews Gary Marcus
8 ChatGPT mistakes to avoid
- Not being specific about your goals
- Not asking it to reduce its output (ask to reduce, remove, compile, or rewrite)
- Mixing topics in a single chat
- Asking only 1 thing at a time
- Prompting in the negative (instead of "without X", add "delete X" to the end of prompt)
- Not giving examples
- Asking it to do math
- Not iterating
On the need for anonymity online
Transformers from Scratch
chinchilla's wild implications
on scaling laws of language models. I still know too little about all of this to make much sense of it.
Death of progress due to AI dependency
The risk of bots filling in critical knowledge/skill gaps, preventing experts in those areas.
Artificial Intelligence - Our World in Data
The viral AI avatar app Lensa undressed me—without my consent
The very bad image dataset bias issue
There is a decision being made about you in this box
From my knowledge, the cost of large language model search
Estimate that ChatGPT will cost $150-$200 per month.
Stable Diffusion Is the Most Important AI Art Model Ever
Still a lot of unresolved ethical (and legal?) questions around generating art in the style of others, but open source data for these models seems an important place to start.
Before the flood
Why "Prompt Engineering" and "Generative AI" are overhyped
We are still figuring out the UI for AIs - making it invisible, zero-click, part of the main input.
Gives Copilot and lex.page as examples; that the bot shares the input box you are using.
Problems with prompt engineering like random phrases you need, that can break between versions.
Either due to dataset changes (like artist names in Stable Diffusion v2), or just other model changes.
And the general interaction pattern; i.e a chat vs a text generator in a document.
Language models hallucinate, and solving that is AGI-hard.
ChatGPT: Optimizing Language Models for Dialogue
ChatGPT interacts in a conversational way. Meant to let it answer followup questions & challenge incorrect premises.
Interesting how significant the UI is for these tools.
Still a lot of echoes of earlier models - it will often erroneously bring up earlier lines from chat, there's not real understanding of the conversation.
The Near Future of AI is Action-Driven
probably time to learn about transformers
Palette - Colorize Photos
Invasive Diffusion: How one unwilling illustrator found herself turned into an AI model
Ethics of the theft of artistic style.
I record myself on audio 24x7 and use an AI to process the information
Some variations, on the challenges of identifying speakers, noise, voice recognition (lot of people using Whisper now).
On perfect memory preventing you from escaping the past - getting caught up reliving things.
AudioLM Examples - Speech and music continuation
Very impressive examples. Continuing speech or piano after a 4s prompt.
Author’s note
Background on their short story, written by Wordcraft AI
Explainpaper
Explain text from papers or answer questions. Cool way to fill in gaps on unfamiliar topics.
From show HN
Viral video of pig balancing on a ball is CGI
what a time to be alive
Lex
Another AI writing tool to try
Title:Avoiding the Great Filter: A Simulation of Important Factors for Human Survival
Counterarguments to the basic AI risk case
Show HN: InvokeAI, an open source Stable Diffusion toolkit and WebUI | Hack
AI is already better at lip reading that we are
Large Motion Frame Interpolation
animate frames between two images
High-performance image generation using Stable Diffusion in KerasCV
Generative AI: A Creative New World
Imagen Video: high definition video generation with diffusion models | Hack
Stable Diffusion Image Variations : a Hugging Face Space by lambdalabs
Simpler image variation than stability-ai one, from another random deep learning company.
A quick look at their website and making a text-to-pokemon model looked interesting.
run stability-ai/stable-diffusion in browser
who is paying for all these gpus
Novel View Synthesis with Diffusion Models
3d model generation (or animation of, at least) from single reference image and a pose
How to Run Stable Diffusion on Your PC to Generate AI Images
Very easy to follow setup guide for Stable Diffusion, and for running python apps on windows.
Installed windows terminal, setup anaconda prompt to auto init directory. Just some good stupid little tips like that
Show HN: Open Prompts – dataset of 10M Stable Diffusion generations
AI Content Generation, Part 1: Machine Learning Basics
Great point that inputs and prompts are actually more like search queries, and great explainer of mapping tagged images to dimensions.
Is searching a boundless space creativity? If all images are represented as numbers, and you are basically picking a number, are you creating or finding? Common twitter sentiment I have seen recently is that it isn't creative work, that feeding/refining prompts is really just more like search. Or that it's like gambling.
4.2 Gigabytes, or: How to Draw Anything
great write up of working through some prompts and backgrounds used for some drawings. For some reason I particularly liked how they added birds
Stable Diffusion is a really big deal
Freya Holmér on Twitter: "I don't think I consider AI generated images art anymore
Can search queries be considered creative input?
Exploring 12 Million of the 2.3 Billion Images Used to Train Stable Diffusion’s Image Generator
Looking inside the black box
DALL·E 2 prompt book [pdf]
Still mixed feelings on DALL-E, the book is good though. Useful some well-explained concepts.
from hn.
How I Used DALL·E 2 to Generate The Logo for OctoSQL
some interesting extra ideas, on top of those from the DALL-E prompt book - tricks like "in a circle" to get something logo-style that is sufficiently centred.
Also smart:
Finally, I did a bunch of Google reverse image searches for it. You know, just to be sure.
Scott Aaronson will work at OpenAI for a 1 year sabbatical : Hacker News
Check back on his blog in a year
faces/hands often look strange, but on the whole, it seems useful for rapidly getting a starting point
Interesting that the generated drawings often have strange faces/hands. Is it because they are more complex shapes? Because we notice issues with them more?
Deepfake Offensive Toolkit dot
real-time, controllable deepfakes ready for virtual cameras injection. The future is terrifying
Imagen
Another text-to-image model. This stuff is getting crazy
DALL-E, the Metaverse, and Zero Marginal Content
Artificial Intelligence Can Now Craft Original Jokes
Publication of the FSF-funded white papers on questions around Copilot
Show HN: Cloning a musical instrument from 16 seconds of audio
Github Copilot Wants to Play Chess Instead of Code
Me too, buddy. Me too
YoHa
Hand tracking in the browser!
Stable Baselines3
Nick Bostrom: Simulation and Superintelligence | Lex Fridman Podcast #83
AI-tocracy
Paper suggesting that innovation and autocracy can be mutually reinforcing, with data and case studies from China.
One-Half of a Manifesto
Neuroscience’s Existential Crisis
First steps with GPT-3 for frontend developers
Descript: edit audio/video as text
An impressive recommendation from @patio11
Also does voice generation from text! And remove filler words. Some cool stuff in there.
The Uselessness of Useful Knowledge
GPT Code Clippy, the Open Source Version of GitHub Copilot
stylegan3 training on landscapes
Crazy stuff. Also the generated faces
Whole Brain Emulation: No Progress on C. elgans After 10 Years
The Intelligence of Bodies
Very interesting discussion. Computers' ability to solve tasks goes from 'impossible' to boring as soon as they solve it! Though I am still regularly impressed by GPS...
I’m midway in the philosophizing here, but my point so far is obvious enough: The ability of a machine to do or outdo something humans do is interesting once at most. Deep Blue isn’t playing chess anymore and Watson isn’t on “Jeopardy!” because nobody cares. It doesn’t matter. We humans need to see the human doing it: Willie Mays making the catch that doesn’t look possible. When it comes to art, we need to see a woman or a man struggling with the universal mediocrity that is the natural lot of all of us and somehow out of some mélange of talent, skill, and luck doing the impossible, making something happen that is splendid and moving—or funny, or frightening, or whatever the artist set out to do.
FLAML - Fast and Lightweight AutoML
The Jessica Simulation: Love and loss in the age of A.I.
The Most Impressive AI Demo I Have Ever Seen
I don't know if I am as blown away as Alex but this is very cool. Like machine pair programming with text.
BirdNET – The easiest way to identify birds by sound.
This is the kind of thing where you think "oh sure that would be a neat side project" but the reality is an immense amount of effort
AI doesn't understand scale
Variations on tomato farming
Ethics of AI: Benefits and risks of artificial intelligence
adversarial.io – Fighting mass image recognition
this requirement reads like the whole thing was made for a specific cat picture
It works best with 299 x 299px images that depict one specific object.
Game AI Pro
All three volumes of Game AI Pro available online! For free!
Excavating AI
Digging through the history of biases and problems with training data and categories used for ML tasks
GPT-2 As Step Toward General Intelligence | Slate Star Codex
See all tags.