| Type | Group |
| Attentional Interface | Attention-Memory |
| Memory-Attention Networks | Attention-Memory |
| One-Shot Associative Memory | Attention-Memory |
| KeyValue Memory Networks | Attention-Memory |
| Compositional Attention Network | Attention-Memory |
| Deep Memory Network | Attention-Memory |
| Structured Attention Network | Attention-Memory |
| Hyperbolic Attention Network | Attention-Memory |
| Multi-Cast Attention Network | Attention-Memory |
| Bi-Directional Attention Flow | Attention-Memory |
| Variational Autoencoder | Autoencoder |
| Autoencoder | Autoencoder |
| Denoising Autoencoder | Autoencoder |
| Sparse Autoencoder | Autoencoder |
| Contrastive Autoencoder | Autoencoder |
| Feedforward | Basic |
| Perceptron | Basic |
| Multilayer Perceptron | Basic |
| Deep Convolutional Network | CNN |
| Convolutional Deep Belief Network | CNN |
| Convolutional GAN | CNN |
| DeConvolutional Network | CNN |
| Deep Convolutional Inverse Graphics Network | CNN |
| Geometric Deep Learning | CNN |
| Convolutional Kernel Networks | CNN |
| Convolutional Autoencoder | CNN |
| Hierarchical Convolutional Deep Maxout Network | CNN |
| Deep Belief Network | DBN |
| Continuous DQN | DQN |
| Deep Q Network | DQN |
| Dueling DQN | DQN |
| Episodic-Memory DQN | DQN |
| Bidirectional LSTM | LSTM |
| Convolutional LSTM | LSTM |
| Grid LSTM | LSTM |
| Long Short Term Memory | LSTM |
| Peephole LSTM | LSTM |
| Phrasal LSTM | LSTM |
| Hierarchical LSTM | LSTM |
| Gated Recurrent Unit | LSTM |
| Adaptive Resonance Theory | Modular |
| Maximum Entropy | Modular |
| Counterpropogation | Modular |
| Spline | Modular |
| Gaussian | Modular |
| Neocognitron | Neural |
| Neural Programmer | Neural |
| Neural Turing Machine | Neural |
| Neuro-Fuzzy | Neural |
| Neuroevolution | Neural |
| Neural Associative Memory | Neural |
| Neural Hawkes Process Memory | Neural |
| Sequence-2-Sequence | Other |
| Deep Feedforward | Other |
| Deep Neural Network | Other |
| Helmholtz Machine | Other |
| Hopfield Network | Other |
| Kohonen Network | Other |
| Compound Hierarchical Deep Model | Other |
| Dense Associative Memory | Other |
| Hierarchical Temporal Memory | Other |
| Large Memory Storage and Retrieval Network | Other |
| Generative Adversarial Network | Other |
| Associative Neural Network | Other |
| Adaptive Computation Time | Other |
| Deep Coding Network | Other |
| Deep Deterministic Policy Gradient | Other |
| Deep Predictive Coding Network | Other |
| Deep Reservoir Computing | Other |
| Deep Residual Network | Other |
| Deep Stacking Network | Other |
| Diffusion Network | Other |
| Echo state Network | Other |
| Elman Jordan Network | Other |
| Extreme Learning Machine | Other |
| Instantaneously Trained Neural Network | Other |
| Learning Vector Quantization | Other |
| Liquid State Machines | Other |
| Spiking Neural Network | Other |
| Tensor Deep Stacking Network | Other |
| Radial Basis Function | Other |
| Recursive Neural Network | Other |
| Markov Chain | Probabilistic |
| Deep Bayesian Neural Network | Probabilistic |
| Deep Markov Model | Probabilistic |
| Stochastic Neural Network | Probabilistic |
| Spike and Slab RBM | RBM |
| Boltzmann Machine | RBM |
| Restricted Bolzmann Machine | RBM |
| Bidirectional RNN | RNN |
| Clockwork RNN | RNN |
| Continuous Time RNN | RNN |
| Dilated RNN | RNN |
| Hierarchical RNN | RNN |
| Recurrent Neural Network | RNN |
| Second Order RNN | RNN |
| Multi-Time Scales RNN | RNN |
| Recurrent Multilayer Perceptron | RNN |
| Deep Kernel Machine | SVM |
| Support Vector Machine | SVM |
| Shallow Neural Networks | ThoughtVectors/WordVectors |
*Shallow = one hidden layer in NN
*Deep = more than one hidden layer in NN