Multimodal Artificial Intelligence and Large Language Models

A Comprehensive Guide from Theory to Practice

AvL Ashok Kumar,D Karthika Renuka

Inbunden, Engelska, 2026

2 040 kr

Kommande

Beskrivning

The book provides a comprehensive technical analysis of multimodal artificial intelligence systems and implementation frameworks. It offers thorough coverage of cross-modal processing methods for use, including speech recognition and automatic image captioning.It presents a detailed discussion of architecture for integrating text, image, audio, and video modalities, cross-modal processing pipelines, and data fusion techniques.Showcases real-time synchronization mechanisms across different modalities and scalable design patterns for multimodal systems.Discusses multimodal emotion recognition using deep Learning techniques, focusing on recent advancements, challenges, and ethical considerations.Investigates deployment optimization strategies to address issues with latency, resource usage, and scalability of multimodal systems.Focuses on techniques for performance optimization, memory management, and distributed processing for multimodal workloads using frameworks like PyTorch and TensorFlow.The text is primarily written for senior undergraduates, graduate students, and academic researchers in electrical engineering, electronics and communications engineering, computer science and engineering, and information technology.

Produktinformation

Utforska kategorier

Mer om författaren

Innehållsförteckning

Hoppa över listan

Du kanske också är intresserad av

  • Nyhet

Sallad!

Danyel Couet

Kartonnage

279 kr