lexical-analysis
A grammar describes the syntax of a programming language, and might be defined in Backus-Naur form (BNF). A lexer performs lexical analysis, turning text into tokens. A parser takes tokens and builds a data structure like an abstract syntax tree (AST). The parser is concerned with context: does the sequence of tokens fit the grammar? A compiler is a combined lexer and parser, built for a specific grammar.
Here are 805 public repositories matching this topic...
Language: All
Sort: Most stars
Bai Du NLP:Fen Ci ,Ci Xing Biao Zhu ,Ming Ming Shi Ti Shi Bie ,Ci Zhong Yao Xing
-
Updated
May 25, 2021 - C++
xmnlp:Ti Gong Zhong Wen Fen Ci , Ci Xing Biao Zhu , Ming Ming Ti Shi Bie ,Qing Gan Fen Xi ,Wen Ben Jiu Cuo ,Wen Ben Zhuan Pin Yin ,Wen Ben Zhai Yao ,Pian Pang Bu Shou ,Ju Zi Biao Zheng Ji Wen Ben Xiang Si Du Ji Suan Deng Gong Neng
-
Updated
Nov 12, 2022 - Python
DFA regular expression library & friends
-
Updated
Feb 10, 2026 - C
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
-
Updated
Dec 25, 2024
Chinese text segmentation with R. RYu Yan Zhong Wen Fen Ci (Wen Dang Yi Geng Xin :https://qinwenfeng.com/jiebaR/ )
-
Updated
Jul 13, 2020 - C++
A lexical analyzer based on DFA that is built using JS and supports multi-language extensions / Yi Ge Ji Yu DFADe Zhi Chi Duo Yu Yan Kuo Zhan De JSBan Kai Yuan Ci Fa Fen Xi Qi
-
Updated
Dec 4, 2025 - JavaScript
LLVM Tutorial: Kaleidoscope (Implementing a Language with LLVM)
-
Updated
Dec 29, 2022 - C++
Allocators, I/O streams, math, geometry, image and audio processing for D
-
Updated
Jan 1, 2026 - D
This is an implementation of a compiler for a simple C-like language that uses the C tools Flex and Bison.
-
Updated
Feb 26, 2026 - C
OysterKit is a framework that provides a native Swift scanning, lexical analysis, and parsing capabilities. In addition it provides a language that can be used to rapidly define the rules used by OysterKit called STLR
-
Updated
Aug 15, 2019 - Swift
A compiler that accepts any valid program written in C. It is made using Lex and Yacc. Returns a symbol table, parse tree, annotated syntax tree and intermediate code.
-
Updated
Apr 24, 2023 - Yacc
[EMNLP 2021] LingFeat - A Comprehensive Linguistic Features Extraction ToolKit for Readability Assessment
-
Updated
Mar 7, 2023 - Python
A module to compute textual lexical richness (aka lexical diversity).
-
Updated
Aug 27, 2023 - Python
clex is a simple lexer generator
-
Updated
Feb 9, 2026 - C
Santiago is a lexing and parsing toolkit for Rust
-
Updated
Feb 19, 2026 - Rust
A procedural programming language built in Rust which compiles to QBE
-
Updated
Feb 23, 2026 - Rust
Modular static malicious JavaScript detection system
-
Updated
Jan 18, 2021 - Python
A simple compiler written from scratch in C++ for an undergraduate course in program translation.
-
Updated
Apr 29, 2018 - C++
- Followers
- 11k followers
- Website
- github.com/topics/parsing
- Wikipedia
- Wikipedia