Skip to content

Latest commit

 

History

History
50 lines (45 loc) · 482 Bytes

README.md

File metadata and controls

50 lines (45 loc) · 482 Bytes

tokenizer

a simple generic code tokenizer written in C, the tokenizer doesn't support specific language, it will just group words and strings

build

make build

example

# test.py

print("hello world")

def foo() -> int:
    return 0

output for tokenizer test.py -gs

(NEWLINE)
print
(
"hello world"
)
(NEWLINE)
(NEWLINE)
(NEWLINE)
#
(SPACE)
comment
(NEWLINE)
def
(SPACE)
foo
(
)
(SPACE)
-
>
(SPACE)
int
:
(NEWLINE)
(SPACEx4)
return
(SPACE)
0