Posts

Showing posts with the label chat gpt

Large Language Models

Image
This is the second post in a series from the basics of machine learning to state of the art large language models ( ChatGPT ,  Bard  and friends). Here the links to the entire series: The basics of Artificial Intelligence and Machine Learning Deep Learning and Neural Networks Large Language Models  (this post) The Transformer Architecture As explained in my previous post , neural networks are an ML model designed after the blueprint of our brain, capable of representing complex relationships and hence deep knowledge. The structure of such a neural network - how the artificial neurons are connected, or in mathematical terms the layout of the network graph - is what we call its architecture . Over the last decade or so, ML researchers have found better and better architectures for a number of different tasks, such as computer vision or language understanding. The analogy in real life us how the different parts of our own brain are wired to perform specific parts like vision...

Deep Learning and Neural Networks

Image
This is the second post in a series from the basics of machine learning to state of the art large language models ( ChatGPT ,  Bard  and friends). Here the links to the entire series: The basics of Artificial Intelligence and Machine Learning Deep Learning and Neural Networks (this post) Large Language Models Fundamentals of Neural Networks Artificial neural networks are a type of machine learning model that can be trained to encapsulate knowledge and use it to predict attributes of data (other models are rule sets or decision trees, as discussed in the last post ). They are modelled after the neuronal structure of the human brain (or any biological brain for that matter) - maybe that's why they're working so well. Like in our brain, artificial neural networks consist of a interconnected web of (simulated) neurons . Each neuron receives signals from a number of input neurons, and if the signals accumulate to a certain threshold, can become activated. If activated, it sends a...

Intro to LLMs and Generative AI

Image
In my first post , I mentioned the emergence of generative AI -  ChatGPT ,  Bard  and friends. But what are these mysterious black boxes that can produce creative texts and are seemingly indistinguishable from human intelligence?  Let's break it down. This is the first in a series of three posts explaining how these models work: The basics of Artificial Intelligence and Machine Learning (this post) Deep Learning and Neural Networks Large Language Models What's Artificial Intelligence and Machine Learning? Both terms are hot buzzwords these days, and there are many definitions for both. One of the founding fathers of computer science, the great Alan Turing , simply defines Artificial Intelligence (AI) as  the science and engineering of making intelligent machines Well, that didn't help! But that's really what it is: Making machines act intelligently. In other words, make them act like humans. AI is umbrella term for different fields and techniques pursuin...

Welcome!

We live in exciting times! This is probably a universal statement many people would claim at any given time throughout the ages, but it certainly feels true and real to me. Lots of things are happening in the world, for better and for worse. Either way, change is happening all around us, and change presents opportunities. I started to see some of these opportunities in two different world - the world of artificial intelligence , and the world of writing .  A few years ago, many would've said these things don't mix. But with recents advancements in machine learning - the emergence of  ChatGPT , Bard  and other generative AI systems has shaken public society probably more than any other discovery in the field -, it's not a stretch to say these two worlds are on a collision course. Whether that's a death sentence for creative writing ( the Writers Guild of America seems to think so ) or a great opportunity will need to stand the test of time, but it's certainly an int...