Fifth Semester B.Tech Core Course Syllabus
Course Code: DA321 | Course Name: Multi-modal Data Processing and Learning - I | Credits: 3-0-0-6 |
---|---|---|
Pre-requisite: None | ||
Syllabus: Introduction: Introduction to Multimodal data and applications, Challenges of multimodal data, Data collection & cleaning. Text Processing: Text normalization, Lemmatization, Morphology, Subword tokenization; Text processing and statistics: TFIDF, BM-25, Zipf’s law, Hipf’s law; Language models and smoothing techniques; Vector space models. Speech Processing: Speech production and perception, Acoustic and articulatory phonetics; Short-term analysis: Need and windowing, Energy, Zero-crossing rate, Autocorrelation function, Fourier transform, Spectrogram; Short-term synthesis: Overlap-add method; Cepstrum analysis: Basis and development, mel-cepstrum. Digital Image and Video Processing: Point processing, Neighborhood processing, Enhancement, Edge detection, Segmentation, Feature descriptors, Restoration, Morphological operations, Image transforms, Spatial and temporal data handling. Other Modalities: Biomedical signals, and Conventional multi-modal learning. | ||
Textbooks:
|
||
References:
|
Course Code: DA351 | Course Name: Computer Systems | Credits: 3-0-0-6 |
---|---|---|
Pre-requisite: | ||
Syllabus: Functional units of a computer: CPU, memory, I/O; Data representation; Processor design: Instruction set architecture, pipelining; Memory: Concept of hierarchical memory organization, cache memory, mapping functions and replacement algorithms, main memory organization; Input-Output: I/O transfers - program-controlled, interrupt-driven and DMA. Processes and threads and their scheduling, synchronization, deadlocks in concurrent processes; Memory management basics, demand paging and virtual memory implementation; File system design and implementation. OSI and TCP/IP Model; Local area networks: Multiple access techniques – wired and wireless; Concepts of switched networks, Internet addressing and routing algorithms; Transport protocols, UDP, TCP, flow control, congestion control; Application Layer: Client-Server and P2P architecture, API; Application layer protocols such as DNS, SSL, WWW, HTTP. | ||
Textbooks:
|
||
References:
|
Course Code: DA322 | Course Name: Machine Learning | Credits: 3-0-0-6 |
---|---|---|
Pre-requisite: None | ||
Syllabus: Introduction to learning: supervised and unsupervised, generative and discriminative models, classification and regression problems, performance measures, design of experiments; Feature space and dimensionality reduction: Feature selection, PCA, exploratory factor analysis, LDA, ICA; Unsupervised learning: K-means clustering, hierarchical agglomerative clustering, DBSCAN, MLE, MAP, Bayesian learning, Gaussian Mixture Models; Supervised learning: Bayesian decision theory, Logistic Regression, data balancing, simple perceptron and multi-layer perceptron, Parzen windows, k-nearest neighbor, decision trees, support vector machines; ensemble methods, bagging and boosting; Applications and case studies. | ||
Textbooks:
|
||
References:
|
Course Code: DA311 | Course Name: Machine Learning Laboratory | Credits: 0-0-3-3 |
---|---|---|
Pre-requisite: None | ||
Syllabus: Sci-kit Learn, NumPy and MatPlotLib; PCA and LDA; K-means Clustering, Hierarchical Agglomerative Clustering and DBSCAN; MLE and Bayesian learning; Linear and Logistic Regression; Perceptron; Data Balancing & Imbalance-Learning; Multi-layer perceptron; k-nearest neighbor, Classification and Regression Trees; Support Vector Machines; Random Forest, AdaBoost. | ||
Textbooks:
|
||
References:
|
Course Code: DA341 | Course Name: Applied Time Series Analysis | Credits: 3-0-0-6 |
---|---|---|
Pre-requisite: None | ||
Syllabus: Fundamental components of time series; Preliminary tests: randomness, trend, seasonality; Estimation/elimination of trend and seasonality; Mathematical formulation of time series; Stationarity concepts; Auto Covariance and Autocorrelation functions of stationary time series and its properties; Linear stationary processes and their time-domain properties: AR, MA, ARMA, seasonal, non-seasonal and mixed models; ARIMA models; Multivariate time series processes and their properties: VAR, VMA and VARMA; Parameter estimation of AR, MA, and ARMA models: LS approach, ML approach for AR, MA and ARMA models, Asymptotic distribution of MLE; Best Linear predictor and Partial autocorrelation function; Model-identification with ACF and PACF; Model order estimation techniques; Frequency domain analysis: spectral density and its properties and its estimation, Periodogram analysis. | ||
Textbooks:
|
||
References:
|
Course Code: DA331 | Course Name: Big Data Analytics: Tools and Techniques | Credits: 2-0-2-6 |
---|---|---|
Pre-requisite: None | ||
Syllabus: Fundamentals of Big Data: Understanding big data, datasets, data analysis, data analytics, big data characteristics, types of data, case studies; Big data adoption and planning considerations: data procurement, big data analytics lifecycle, case study examples; Big data storage concepts: cluster computing, file system, distributed file systems, Relational & non-relational databases, scaling up & scaling out storage; No-SQL: Data types, Creating, Updating & Deleting documents, Querying, An example No-SQL database; Distributed computing framework: Introduction, file system, MapReduce programming model, examples of distributed computing environment framework; Stream data processing: tools such as Apache Spark, Apache Storm; Analytics with distributed computing framework: supervised learning examples, unsupervised learning examples. | ||
Textbooks:
|
||
References:
|