Posts

Showing posts from June, 2023

Large Language Models

Image
This is the second post in a series from the basics of machine learning to state of the art large language models ( ChatGPT ,  Bard  and friends). Here the links to the entire series: The basics of Artificial Intelligence and Machine Learning Deep Learning and Neural Networks Large Language Models  (this post) The Transformer Architecture As explained in my previous post , neural networks are an ML model designed after the blueprint of our brain, capable of representing complex relationships and hence deep knowledge. The structure of such a neural network - how the artificial neurons are connected, or in mathematical terms the layout of the network graph - is what we call its architecture . Over the last decade or so, ML researchers have found better and better architectures for a number of different tasks, such as computer vision or language understanding. The analogy in real life us how the different parts of our own brain are wired to perform specific parts like vision, memory or ot

Deep Learning and Neural Networks

Image
This is the second post in a series from the basics of machine learning to state of the art large language models ( ChatGPT ,  Bard  and friends). Here the links to the entire series: The basics of Artificial Intelligence and Machine Learning Deep Learning and Neural Networks (this post) Large Language Models Fundamentals of Neural Networks Artificial neural networks are a type of machine learning model that can be trained to encapsulate knowledge and use it to predict attributes of data (other models are rule sets or decision trees, as discussed in the last post ). They are modelled after the neuronal structure of the human brain (or any biological brain for that matter) - maybe that's why they're working so well. Like in our brain, artificial neural networks consist of a interconnected web of (simulated) neurons . Each neuron receives signals from a number of input neurons, and if the signals accumulate to a certain threshold, can become activated. If activated, it sends a