Document Layout Analysis
Document Layout Analysis (DLA) is a subtask in the field of computer vision aimed at determining the physical structure of a document, i.e., identifying various components within it. These components can be individual connected component regions, formed by adjacent pixels into a single area, or a group of text lines. Text lines refer to a collection of characters, symbols, and words that are relatively close to each other and can be connected by a straight line, typically with horizontal or vertical orientation. This task holds significant application value for automated document processing, information extraction, and content understanding.