Reinforcement learning (machine learning)
Reinforcement learning is one of the main algorithms used in machine learning in the context of an agent in an environment. In each timestep, this agent takes in information from their environment and performs an action. Certain actions reward the agent.
Reinforcement learning maximizes these rewards, a difficult task given that these rewards may be very infrequent and may not have an obvious correlation with each action. Common reinforcement learning tasks including teaching a robot to walk, a car to drive.
Reinforcement learning can be used in image classification by implementing attention mechanisms. In such a problem, a model can choose to focus on smaller parts of a high resolution image, moving to different parts depending on what it sees, and subsequently producing a predictive label.
Related Radiopaedia articles
Artificial intelligence
- artificial intelligence (AI)
- imaging data sets
- computer-aided diagnosis (CAD)
- natural language processing
- machine learning (overview)
- visualizing and understanding neural networks
- common data preparation/preprocessing steps
- DICOM to bitmap conversion
- dimensionality reduction
- scaling
- centering
- normalization
- principal component analysis
- training, testing and validation datasets
- augmentation
- loss function
- optimization algorithms
- ADAM
- momentum (Nesterov)
- stochastic gradient descent
- mini-batch gradient descent
- regularisation
- linear and quadratic
- batch normalization
- ensembling
- rule-based expert systems
- glossary
- activation function
- anomaly detection
- automation bias
- backpropagation
- batch size
- computer vision
- concept drift
- cost function
- confusion matrix
- convolution
- cross validation
- curse of dimensionality
- dice similarity coefficient
- dimensionality reduction
- epoch
- explainable artificial intelligence/XAI
- feature extraction
- federated learning
- gradient descent
- ground truth
- hyperparameters
- image registration
- imputation
- iteration
- jaccard index
- linear algebra
- noise reduction
- normalization
- R (Programming language)
- Python (Programming language)
- segmentation
- semi-supervised learning
- synthetic and augmented data
- overfitting
- transfer learning