Illustrious: an Open Advanced Illustration Model

Bibliographic Details
Title: Illustrious: an Open Advanced Illustration Model
Authors: Park, Sang Hyun, Koh, Jun Young, Lee, Junha, Song, Joy, Kim, Dongha, Moon, Hoyeon, Lee, Hyunju, Song, Min
Publication Year: 2024
Collection: Computer Science
Subject Terms: Computer Science - Computer Vision and Pattern Recognition
More Details: In this work, we share the insights for achieving state-of-the-art quality in our text-to-image anime image generative model, called Illustrious. To achieve high resolution, dynamic color range images, and high restoration ability, we focus on three critical approaches for model improvement. First, we delve into the significance of the batch size and dropout control, which enables faster learning of controllable token based concept activations. Second, we increase the training resolution of images, affecting the accurate depiction of character anatomy in much higher resolution, extending its generation capability over 20MP with proper methods. Finally, we propose the refined multi-level captions, covering all tags and various natural language captions as a critical factor for model development. Through extensive analysis and experiments, Illustrious demonstrates state-of-the-art performance in terms of animation style, outperforming widely-used models in illustration domains, propelling easier customization and personalization with nature of open source. We plan to publicly release updated Illustrious model series sequentially as well as sustainable plans for improvements.
Document Type: Working Paper
Access URL: http://arxiv.org/abs/2409.19946
Accession Number: edsarx.2409.19946
Database: arXiv
FullText Text:
  Availability: 0
CustomLinks:
  – Url: http://arxiv.org/abs/2409.19946
    Name: EDS - Arxiv
    Category: fullText
    Text: View this record from Arxiv
    MouseOverText: View this record from Arxiv
  – Url: https://resolver.ebsco.com/c/xy5jbn/result?sid=EBSCO:edsarx&genre=article&issn=&ISBN=&volume=&issue=&date=20240930&spage=&pages=&title=Illustrious: an Open Advanced Illustration Model&atitle=Illustrious%3A%20an%20Open%20Advanced%20Illustration%20Model&aulast=Park%2C%20Sang%20Hyun&id=DOI:
    Name: Full Text Finder (for New FTF UI) (s8985755)
    Category: fullText
    Text: Find It @ SCU Libraries
    MouseOverText: Find It @ SCU Libraries
Header DbId: edsarx
DbLabel: arXiv
An: edsarx.2409.19946
RelevancyScore: 1112
AccessLevel: 3
PubType: Report
PubTypeId: report
PreciseRelevancyScore: 1112.25915527344
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Illustrious: an Open Advanced Illustration Model
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Park%2C+Sang+Hyun%22">Park, Sang Hyun</searchLink><br /><searchLink fieldCode="AR" term="%22Koh%2C+Jun+Young%22">Koh, Jun Young</searchLink><br /><searchLink fieldCode="AR" term="%22Lee%2C+Junha%22">Lee, Junha</searchLink><br /><searchLink fieldCode="AR" term="%22Song%2C+Joy%22">Song, Joy</searchLink><br /><searchLink fieldCode="AR" term="%22Kim%2C+Dongha%22">Kim, Dongha</searchLink><br /><searchLink fieldCode="AR" term="%22Moon%2C+Hoyeon%22">Moon, Hoyeon</searchLink><br /><searchLink fieldCode="AR" term="%22Lee%2C+Hyunju%22">Lee, Hyunju</searchLink><br /><searchLink fieldCode="AR" term="%22Song%2C+Min%22">Song, Min</searchLink>
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2024
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: Computer Science
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Computer+Science+-+Computer+Vision+and+Pattern+Recognition%22">Computer Science - Computer Vision and Pattern Recognition</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: In this work, we share the insights for achieving state-of-the-art quality in our text-to-image anime image generative model, called Illustrious. To achieve high resolution, dynamic color range images, and high restoration ability, we focus on three critical approaches for model improvement. First, we delve into the significance of the batch size and dropout control, which enables faster learning of controllable token based concept activations. Second, we increase the training resolution of images, affecting the accurate depiction of character anatomy in much higher resolution, extending its generation capability over 20MP with proper methods. Finally, we propose the refined multi-level captions, covering all tags and various natural language captions as a critical factor for model development. Through extensive analysis and experiments, Illustrious demonstrates state-of-the-art performance in terms of animation style, outperforming widely-used models in illustration domains, propelling easier customization and personalization with nature of open source. We plan to publicly release updated Illustrious model series sequentially as well as sustainable plans for improvements.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Working Paper
– Name: URL
  Label: Access URL
  Group: URL
  Data: <link linkTarget="URL" linkTerm="http://arxiv.org/abs/2409.19946" linkWindow="_blank">http://arxiv.org/abs/2409.19946</link>
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsarx.2409.19946
PLink https://login.libproxy.scu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&scope=site&db=edsarx&AN=edsarx.2409.19946
RecordInfo BibRecord:
  BibEntity:
    Subjects:
      – SubjectFull: Computer Science - Computer Vision and Pattern Recognition
        Type: general
    Titles:
      – TitleFull: Illustrious: an Open Advanced Illustration Model
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Park, Sang Hyun
      – PersonEntity:
          Name:
            NameFull: Koh, Jun Young
      – PersonEntity:
          Name:
            NameFull: Lee, Junha
      – PersonEntity:
          Name:
            NameFull: Song, Joy
      – PersonEntity:
          Name:
            NameFull: Kim, Dongha
      – PersonEntity:
          Name:
            NameFull: Moon, Hoyeon
      – PersonEntity:
          Name:
            NameFull: Lee, Hyunju
      – PersonEntity:
          Name:
            NameFull: Song, Min
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 30
              M: 09
              Type: published
              Y: 2024
ResultId 1