Passion Artificial Intelligence: Artificial Intelligence

Artificial Intelligence Author: Djibril Chimère DIAW

Artificial Intelligence, published on April 2, 2023, is a comprehensive and systematically structured academic work that presents the theoretical foundations, historical development, algorithmic architectures, and applied domains of modern artificial intelligence. Conceived as both a reference volume and a structured educational resource, the book integrates machine learning, deep learning, natural language processing, computer vision, robotics, knowledge representation, optimization, evolutionary computation, and augmented intelligence into a unified conceptual framework.

The volume opens with a historical and epistemological overview of artificial intelligence, tracing its development from the foundational period of the 1950s and 1960s through the cycles of optimism and decline in the 1970s–1980s, the resurgence of the 1990s–2000s, and the contemporary era of large-scale data-driven AI systems. This contextual grounding situates modern advances within their intellectual lineage.

A substantial portion of the work is devoted to machine learning, including supervised, unsupervised, semi-supervised, and reinforcement learning. Regression models, classification algorithms, decision trees, support vector machines, ensemble methods, clustering techniques, dimensionality reduction, Q-learning, SARSA, and policy-based approaches are examined both conceptually and algorithmically. Deep learning architectures—convolutional neural networks (CNNs), recurrent neural networks (RNNs), LSTM, GRU, and generative adversarial networks (GANs)—are presented as extensions of statistical learning theory and representation learning.

The book provides extensive coverage of natural language processing (NLP), including sentiment analysis, named entity recognition, topic modeling, dependency parsing, summarization (extractive and abstractive), question answering systems, natural language generation, and machine translation. Both rule-based and statistical paradigms are addressed, alongside neural and transformer-based language models. Key algorithmic families such as Hidden Markov Models, Conditional Random Fields, word embeddings (Word2Vec, GloVe), transformer architectures, and large generative language models are analyzed within a coherent methodological structure.

Computer vision is treated in equal depth, covering traditional feature-based approaches and modern deep learning systems for object detection, image classification, image segmentation, and image captioning. Architectures such as RCNN variants, YOLO, U-Net, ResNet, DenseNet, and transformer-based vision models are integrated into a broader discussion of perception systems.

Beyond perception and language, the volume addresses robotics (kinematics, dynamics, control systems, motion planning, human–robot interaction), expert systems, cognitive computing, speech recognition, swarm intelligence algorithms, evolutionary computation, and optimization methods. Knowledge representation and reasoning are explored through propositional logic, first-order logic, description logic, modal and temporal logic, ontologies, semantic networks, constraint satisfaction, and automated planning.

The final sections extend the discussion toward augmented intelligence, human–computer interaction, intelligent agents, recommender systems, adaptive interfaces, and decision support systems, emphasizing the collaborative dimension between human reasoning and computational intelligence.

Rather than presenting artificial intelligence as a collection of isolated techniques, this book articulates AI as an integrated scientific discipline grounded in mathematics, algorithmic reasoning, formal logic, optimization theory, and computational systems engineering. The structure progresses from foundational concepts to advanced architectures and interdisciplinary applications, making the work suitable for students, independent researchers, engineers, and professionals seeking a rigorous and comprehensive overview.

This first edition captures a particular moment in the evolution of artificial intelligence research and practice. Future editions may expand or refine specific sections as the field continues to advance. Nevertheless, the present volume is conceived as a durable intellectual contribution, preserving a structured and systematic vision of AI in the early decades of the twenty-first century.

Edition information

Publié le : 2 avril 2023
First published: [2023/04/02]
Current version: v1.0
Last updated: [2023/04/02]

Author: [Djibril Chimère DIAW]
Original language: [ENGLISH]
Digital publication: Archive.org
Collection: [Artificial Intelligence]

https://archive.org/details/artificial-intelligence_202602

Contents
Artificial Intelligence    1
Copyright    2
Author’s Note    3
About The Author    4
Dedication    6
To all mothers,    7
Artificial Intelligence (AI)    15
Milestones in AI    16
1950s-1960s :The birth of AI    16
1950s-1960s Perspectives    17
1970s-1980s : The rise and fall of AI    18
1990s-2000s : The resurgence of AI    19
2010s-present :The age of AI    20
1 Machine Learning (ML)    21
1.1 Supervised Learning    22
1.1.1 Regression    23
1.1.1.1 Linear regression    24
1.1.1.2 Polynomial regression    25
1.1.1.3 Ridge regression    26
1.1.1.4 Lasso regression    27
1.1.1.5 Support vector regression (SVR)    28
1.1.2 Classification    29
1.1.2.1 Logistic regression    30
1.1.2.2 Decision trees    31
1.1.2.3 Random forests    32
1.1.2.4 Support vector machines    33
1.1.2.5 Neural networks    34
1.2 Unsupervised Learning    35
1.2.1 Clustering    36
1.2.1.1 K-means clustering    37
1.2.1.2 Hierarchical clustering    38
1.2.1.3 Density-based clustering    39
1.2.2 Dimensionality reduction    40
1.2.2.1 Feature selection    41
1.2.2.2 Feature extraction    42
1.3 Semi-supervised Learning    43
1.3.1 Self-training    44
1.3.2 Co-training    45
1.3.3 Graph-based methods    46
1.4 Reinforcement Learning    47
1.4.1 Value-based methods    48
1.4.1.1 Q-learning    49
1.4.1.2 SARSA (State-Action-Reward-State-Action)    50
1.4.2 Policy-based methods    51
1.4.2.1 Policy gradient method    52
1.4.2.2 Actor-critic method    53
2 Deep Learning    54
2.1 Convolutional Neural Networks (CNN)    55
2.2 Recurrent Neural Networks (RNN)    56
2.2.1 Simple Recurrent Neural Networks    57
2.2.2 Gated Recurrent Units (GRU) Networks    58
2.2.3 Long Short-Term Memory (LSTM) networks    59
2.3 Generative Adversarial Networks (GAN)    60
3 Natural Language Processing (NLP)    61
3.1 Sentiment Analysis    62
3.1.1 Rule-based methods    63
3.1.2 Machine learning-based methods    64
3.2 Named Entity Recognition (NER)    65
3.3 Text Summarization    66
3.3.1 TextRank    67
3.3.2 Latent Semantic Analysis (LSA)    68
3.3.3 Latent Dirichlet Allocation (LDA)    69
3.4 Language Translation    70
3.4.1 Rule-based machine translation    71
3.4.2 Statistical machine translation    72
3.4.3 Neural machine translation    73
4 Computer Vision (CV)    74
4.1 Object Detection    75
4.1.1 Traditional computer vision methods    76
4.1.1.1 Feature detection and extraction    77
4.1.1.2 Template matching    78
4.1.1.3 Image segmentation    79
4.1.1.4 Optical flow    80
4.1.1.5 Hough transform    81
4.1.2 Region-based convolutional neural networks (RCNNs)    82
4.1.2.1 Fast R-CNN    83
4.1.2.2 Faster R-CNN    84
4.1.2.3 Mask R-CNN    85
4.1.3 Single-shot detectors    86
4.1.3.1 VGG (Visual Geometry Group)    87
4.1.3.2 ResNet (Residual Network)    88
4.1.4 You Only Look Once (YOLO)    89
4.1.4.1 Anchor boxes    90
4.1.4.2 Non-maximum suppression    91
4.2 Image Segmentation    92
4.2.1 Thresholding    93
4.2.2 Edge-based segmentation    94
4.2.3 Region-based segmentation    95
4.2.4 Watershed segmentation    96
4.2.5 Deep learning-based segmentation    97
4.2.5.1 U-Net    98
4.2.5.2 Fully Convolutional Networks (FCN)    99
4.2.5.3 DeepLab    100
4.2.5.4 SegNet    101
4.3 Image Classification    102
4.3.1 Traditional computer vision methods    103
4.3.1.1 Bag-of-Visual-Words model    104
4.3.1.2 Histogram of Oriented Gradients (HOG) feature descriptor    105
4.3.2 Deep learning-based methods    106
4.3.2.1 Convolutional neural networks (CNNs)    107
4.3.2.2 Transfer learning    108
4.3.2.3 Residual networks (ResNets)    109
4.3.2.4 Dense convolutional networks (DenseNets)    110
4.3.2.5 Inception networks    111
4.4 Image Captioning    112
4.4.1 CNN-RNN model    113
4.4.2 Encoder-Decoder model    114
4.4.3 Attention-based model    115
4.4.4 Transformer-based model    116
5 Robotics    117
5.1 Kinematics    118
5.1.1 Forward kinematic    119
5.1.2 Inverse kinematics    120
5.2 Dynamics    121
5.2.1 Forward Dynamics    122
5.2.2 Inverse Dynamics    123
5.3 Control Systems    124
5.3.1 Classical control theory    125
5.3.2 Fuzzy logic control    126
5.3.3 Neural network-based control    127
5.4 Robot perception    128
5.5 Motion planning    129
5.6 Human-robot interaction    130
6. Expert Systems    131
6.1 Rule-based Systems    132
6.2 Knowledge-based Systems    133
6.3 Decision Support Systems    134
6.4 Fuzzy Logic Systems    135
7. Cognitive Computing    136
7.1 Speech Recognition    137
7.1.1 Acoustic modeling    138
7.1.1.1 Hidden Markov Models    139
7.1.1.2 Gaussian Mixture Models (GMMs)    140
7.1.1.3 Deep Neural Networks (DNNs)    141
7.1.1.4 Convolutional Neural Networks (CNNs)    142
7.1.1.5 Recurrent Neural Networks (RNNs)    143
7.1.2 Language modeling algorithms    144
7.1.2.1 n-gram models    145
7.1.2.2 Neural language models    146
7.1.2.2.1 Recurrent neural network language model (RNNLM)    147
7.1.2.2.2 Transformer language model    148
7.1.2.3 Transformer-based models    149
7.1.2.4 Maximum Entropy Markov Models    150
7.1.2.5 Conditional Random Fields    151
7.1.2.6 Probabilistic context-free grammars    152
7.1.2.7 Word embeddings    153
7.1.2.7.1 Word2Vec    154
7.1.2.7.2 GloVe    155
7.1.2.8 Recurrent neural networks (RNNs)    156
7.1.2.9 Hidden Markov Models (HMMs)    157
7.1.3 Decoding algorithms    158
7.1.3.1 Viterbi algorithm    159
7.1.3.2 Beam search    160
7.1.3.3 CYK algorithm    161
7.1.3.5 Earley algorithm    162
7.2 Emotion Detection    163
7.2.1 Support Vector Machines (SVMs)    165
7.2.2 Random Forest    166
7.2.3 Deep Learning applications    167
7.2.4 Naive Bayes applications    168
7.2.5 K-Nearest Neighbors (KNN)    169
7.2.6 Decision Trees applications    170
7.3 Natural Language Understanding    171
7.3.1 Information Retrieval algorithm    172
7.3.1.1 Boolean retrieval    174
7.3.1.2 Vector space model (VSM)    175
7.3.1.3 Latent Semantic Analysis (LSA)    176
7.3.1.4 Latent Dirichlet Allocation (LDA)    177
7.3.1.5 PageRank    178
7.3.1.6 TF-IDF (Term Frequency-Inverse Document Frequency)    179
7.3.2 Topic Modeling algorithm    180
7.3.2.1 Non-negative Matrix Factorization (NMF)    181
7.3.2.2 Hierarchical Dirichlet Process (HDP)    182
7.3.2.3 Latent Semantic Analysis (LSA)    183
7.3.2.4 Latent Dirichlet Allocation (LDA)    184
7.3.3 Dependency Parsing    185
7.3.3.1 Arc-standard parsing algorithm    186
7.3.3.2 Arc-eager parsing algorithm    187
7.3.4 Summarization algorithm    188
7.3.4.1 Extractive summarization    189
7.3.4.1.1 TextRank    190
7.3.4.1.2 LexRank    191
7.3.4.1.3 Latent Semantic Analysis (LSA)    192
7.3.4.2 Abstractive summarization    193
7.3.4.2.1 Pointer-Generator Networks    194
7.3.4.2.2 Transformer models    195
7.3.4.2.2.1 Bidirectional Encoder Representations from Transformers,    196
7.3.4.2.2.2 GPT-2 (Generative Pre-trained Transformer 2)    197
7.3.4.2.3 GPT-3    198
7.3.5 Question Answering (QA) algorithm    199
7.3.5.1 Information Retrieval-Based QA    200
7.3.5.2 Knowledge-Based QA    201
7.3.5.3 Language Model-Based QA    202
7.3.5.4 Hybrid QA    203
7.3.6 Natural Language Generation algorithms    204
7.3.6.1 Template-based generation    205
7.3.6.2 Rule-based generation    206
7.3.6.3 Markov Chain-based generation    207
7.3.6.4 Neural network-based generation    208
7.3.6.5 GPT (Generative Pre-trained Transformer)    209
7.3.6.6 Text-to-Speech (TTS) generation    210
7.3.6.7 Planning-based generation    211
7.3.6.8 Evolutionary algorithms-based generation    212
7.3.7 Machine Translation algorithm    213
7.3.7.1 Rule-based Machine Translation    214
7.3.7.2 Statistical Machine Translation    215
7.3.7.3 Neural Machine Translation    216
7.3.7.4 Hybrid Machine Translation    217
7.3.7.5 Example-based Machine Translation    218
7.3.7.6 Transfer-based Machine Translation    219
7.3.7.7 Interactive Machine Translation    220
7.3.8 Named Entity Recognition (NER)    221
7.3.8.1 Rule-Based NER    222
7.3.8.2 Statistical NER    223
7.3.8.3 Machine Learning NER    224
7.3.8.4 Deep Learning NER    225
7.3.8.5 Hybrid NER    226
7.3.8.6 Unsupervised NER    227
8 Swarm Intelligence    228
8.1 Ant Colony Optimization    229
8.2 Artificial Bee Colony    231
8.3 Bat Algorithm    232
8.4 Bacterial Foraging Optimization    233
8.5 Cuckoo Search (CS) algorithm    234
8.6 Firefly Algorithm    235
8.7 Fish Swarm Optimization    236
8.8 Genetic Algorithm    237
8.9 Glowworm Swarm Optimization    239
8.10 Grey Wolf Optimizer    240
8.11 Harmony Search    241
8.12 Particle Swarm Optimization    242
8.13 Social Spider Optimization    243
8.14 Whale Optimization Algorithm    244
9 Knowledge Representation and Reasoning algorithms    245
9.1 Logic-based approaches    247
9.1.1 Propositional logic    248
9.1.2 First-order logic    249
9.1.3 Description logic    250
9.1.4 Modal logic    251
9.1.4.1 Propositional modal logic    252
9.1.4.2 First-order modal logic    253
9.1.4.3 Dynamic logic    254
9.1.5 Default Logic    255
9.1.6 Fuzzy Logic    256
9.1.7 Temporal logic    257
9.2 Ontologies    258
9.3 Semantic Networks    260
9.3.1 Conceptual graph    261
9.3.2 Ontology engineering    263
9.3.3 Semantic Web    265
9.4 Frames    267
9.5 Case-Based Reasoning    269
9.6 Inductive Logic Programming    270
9.7 Constraint Satisfaction    271
9.7.1 Backtracking    272
9.7.2 Constraint propagation    273
9.7.2.1 Forward checking    275
9.7.2.2 Arc consistency    276
9.7.3 Local search    277
10 Planning and Scheduling    278
10.1 Automated Planning    280
10.2 Temporal Reasoning    281
10.3 Resource Allocation    282
10.4 Activity Recognition    283
10.5 Planning    284
10.5.1 Forward planning    287
10.5.2 Backward planning    288
10.5.3 State-space search    289
10.5.4 Heuristic search    290
10.6 Scheduling    291
10.6.1 Constraint-based scheduling    292
10.6.2 Resource-constrained scheduling    293
10.6.3 Dynamic scheduling    294
10.6.4 Optimization-based scheduling    295
11 Evolutionary Computation    296
11.1 Genetic Algorithms    297
11.2 Genetic Programming    298
11.3 Evolution Strategies    299
11.4 Evolutionary Programming    300
11.5 Differential Evolution    301
11.6 Artificial Immune Systems    302
11.6.1 Clonal selection algorithm (CSA)    303
11.6.2 Artificial immune network    304
11.6.1 Clonal selection algorithm (CSA)    305
11.6.2 Artificial immune network    306
11.6.3 Negative selection algorithm    307
11.6.3 Negative selection algorithm    308
12. Augmented Intelligence    309
12.1 Human-Computer Interaction    310
12.2 Decision Support Systems    311
12.2.1 Model-driven Decision Support Systems    312
12.2.2 Data-driven Decision Support Systems    314
12.2.3 Knowledge-driven Decision Support Systems    316
12.3 Intelligent User Interfaces    318
12.3.1 Adaptive Interfaces    320
12.3.2 Intelligent Agents    321
12.3.3 Multimodal Interfaces    323
12.3.4 Augmented Reality Interfaces    324
12.3.5 Conversational Interfaces    325
12.4 Recommender Systems    326
12.4.1 Content-based recommender systems    327
12.4.2 Collaborative filtering recommender systems    328
12.4.3 Hybrid recommender systems    330
Biliography    331
Books    331
Articles    334

mardi 10 mars 2026

Artificial Intelligence

Artificial Intelligence