SFr. 75.00
€ 81.00


bestellen

Artikel-Nr. 42745309


Diesen Artikel in meine
Wunschliste
Diesen Artikel
weiterempfehlen
Diesen Preis
beobachten

Weitersagen:



Autor(en): 
  • Mark Liu
  • Build a Text-to-Image Generator (from Scratch) 
     

    (Buch)
    Dieser Artikel gilt, aufgrund seiner Grösse, beim Versand als 3 Artikel!


    Übersicht

    Auf mobile öffnen
     
    Lieferstatus:   i.d.R. innert 7-14 Tagen versandfertig
    Veröffentlichung:  Januar 2026  
    Genre:  EDV / Informatik 
     
    3D-Grafik und Modellierung / Mensch-Computer-Interaktion / Systemanalyse und -design
    ISBN:  9781633435421 
    EAN-Code: 
    9781633435421 
    Verlag:  Manning Publications 
    Einband:  Gebunden  
    Sprache:  English  
    Dimensionen:  H 235 mm / B 212 mm / D 23 mm 
    Gewicht:  650 gr 
    Seiten:  360 
    Bewertung: Titel bewerten / Meinung schreiben
    Inhalt:
    Get a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.

    This book takes you step-by-step through creating your own AI models that can generate images from text. You’ll explore two methods of image generation—vision transformers and diffusion models—and learn vital AI development techniques as you go.

    Dive into the powerful models behind AI image generators. The best way to learn is to build something from scratch, and in this book you’ll build your very own diffusion model and vision transformer. As you work through each stage of development, you’ll develop an understanding of how these models can be customized, applied, and integrated for impressive multimodal AI.

    Build a Text-to-Image Generator (from Scratch) teaches you how to:

    • Build and train models to generate high resolution images based on text descriptions
    • Edit an existing image based on text prompts
    • Build and train a model to add captions to images
    • Build and train a vision transformer to classify images
    • Fine-tune LLMs for downstream tasks such as classification, text or image generation
    • Better differentiate real images from deepfakes

    About the technology

    AI-generated images appear everywhere from high-end advertising to casual social media feeds. Text-to-image tools like Dall-e, Midjourney, and Flux make it easy to create AI art, but how do they work? In this book, you’ll find out by building your own text-to-image generator!

    About the book

    Build a Text-to-Image Generator (from Scratch) explores both transformer-based image generation and diffusion models. You’ll work hands-on to build a pair of simple generation models that can classify images, automatically add captions, reconstruct images, and enhance existing graphics. Author Mark Liu guides you every step of the way with clear explanations, informative diagrams, and eye-opening examples you can build on your own laptop.

    What's inside

    • Build a vision transformer to classify images
    • Edit images using text prompts
    • Fine-tune image models

    About the reader

    Requires basic knowledge of generative AI models and intermediate Python skills.

    About the author

    Mark Liu is the founding director of the Master of Science in Finance program at the University of Kentucky. He is also the author of Learn Generative AI with PyTorch.

    Table of Contents

    Part 1
    1 A tale of two models: Transformers and diffusions
    2 Build a transformer
    3 Classify images with a vision transformer
    4 Add captions to images
    Part 2
    5 Generate images with diffusion models
    6 Control what images to generate in diffusion models
    7 Generate high-resolution images with diffusion models
    Part 3
    8 CLIP: A model to measure the similarity between image and text
    9 Text-to-image generation with latent diffusion
    10 A deep dive into Stable Diffusion
    Part 4
    11 VQGAN: Convert images into sequences of integers
    12 A minimal implementation of DALL-E
    Part 5
    13 New developments and challenges in text-to-image generation
    A Installing PyTorch and enabling GPU training locally and in Colab

      



    Wird aktuell angeschaut...
     

    Zurück zur letzten Ansicht


    AGB | Datenschutzerklärung | Mein Konto | Impressum | Partnerprogramm
    Newsletter | 1Advd.ch RSS News-Feed Newsfeed | 1Advd.ch Facebook-Page Facebook | 1Advd.ch Twitter-Page Twitter
    Forbidden Planet AG © 1999-2026
    Alle Angaben ohne Gewähr
     
    SUCHEN

     
     Kategorien
    Im Sortiment stöbern
    Genres
    Hörbücher
    Aktionen
     Infos
    Mein Konto
    Warenkorb
    Meine Wunschliste
     Kundenservice
    Recherchedienst
    Fragen / AGB / Kontakt
    Partnerprogramm
    Impressum
    © by Forbidden Planet AG 1999-2026