Overview

Kotatsu Parsers is a library that provides a standardized interface for accessing manga from various web sources. It's designed to work in both JVM and Android applications, offering a unified API for fetching manga listings, details, chapters, and page images regardless of the source website structure.

This document provides a high-level overview of the Kotatsu Parsers architecture, its key components, and how they interact. For information about installation and usage, see Installation and Usage.

Purpose

The library solves several challenges in manga content aggregation:

Providing a consistent interface for diverse manga sources
Handling different website structures and APIs
Managing pagination, search, and filtering
Resolving direct image URLs for manga pages
Configuring parsers for different domains and user preferences

Architecture Overview

The Kotatsu Parsers library is organized around a core MangaParser interface, with implementations for various manga websites. The library follows a modular architecture that separates the parser logic from the data models and network operations.

Key Components

MangaParser Interface

The MangaParser interface is the core of the library, defining the methods that all parsers must implement:

interface MangaParser : Interceptor {
    val source: MangaParserSource
    val availableSortOrders: Set<SortOrder>
    val searchQueryCapabilities: MangaSearchQueryCapabilities
    val config: MangaSourceConfig
    val domain: String

    suspend fun getList(query: MangaSearchQuery): List<Manga>
    suspend fun getDetails(manga: Manga): Manga
    suspend fun getPages(chapter: MangaChapter): List<MangaPage>
    suspend fun getPageUrl(page: MangaPage): String
    suspend fun getFilterOptions(): MangaListFilterOptions
    suspend fun getFavicons(): Favicons
    fun onCreateConfig(keys: MutableCollection<ConfigKey<*>>)
    suspend fun getRelatedManga(seed: Manga): List<Manga>
}

This interface provides methods for:

Listing manga with search and filter capabilities
Retrieving detailed information about a manga
Getting pages for a specific chapter
Resolving the direct URL for a page image
Getting filter options supported by the source

Parser Base Classes

The library provides several abstract base classes that implement common parsing patterns:

AbstractMangaParser: A base implementation with common utilities
LegacyPagedMangaParser: For sites with page-based navigation
LegacySinglePageMangaParser: For sites with all content on a single page

The library provides a powerful search and filtering system with two approaches:

Modern: Using MangaSearchQuery with criteria like Include, Exclude, and Match
Legacy: Using MangaListFilter (now deprecated but still supported)

Each parser declares its search capabilities through searchQueryCapabilities, which defines what search fields and criteria types it supports.

Configuration System

Parsers can be configured through a configuration system based on ConfigKey. Every parser must define a configKeyDomain which specifies the default domain and possible alternatives. This allows the application to handle domain changes without updating the library.

Implementation Examples

The library includes parsers for many popular manga websites:

API-Based Parsers

MangaDex is an example of a parser using a JSON API:

@MangaSourceParser("MANGADEX", "MangaDex")
internal class MangaDexParser(context: MangaLoaderContext) : 
    AbstractMangaParser(context, MangaParserSource.MANGADEX) {

    override val configKeyDomain = ConfigKey.Domain("mangadex.org")

    // Implements methods to work with MangaDex API
}

HTML-Based Parsers

Many parsers work with HTML content using JSoup:

@MangaSourceParser("COMICK_FUN", "ComicK")
internal class ComickFunParser(context: MangaLoaderContext) :
    LegacyPagedMangaParser(context, MangaParserSource.COMICK_FUN, 20) {

    override val configKeyDomain = ConfigKey.Domain("comick.io")

    // Implements methods to parse HTML content
}

Common Base Parsers

The library includes specialized base parsers for common website platforms:

MadaraParser: For sites using the Madara WordPress theme
WpComicsParser: For WordPress comic sites
GroupleParser: For Grouple-based sites
MangaReaderParser: For sites using MangaReader-like structures

This approach reduces code duplication and makes it easier to add support for new sites that use these common platforms.

Conclusion

The Kotatsu Parsers library provides a robust and flexible framework for accessing manga from various online sources. Its architecture makes it easy to add new sources while maintaining a consistent interface for applications that use the library.

For information on how to include and use the library in your project, see Installation and Usage. To learn about specific components in more detail, refer to the corresponding sections in this wiki.