|
|
|
|
|
linguist.page@gmail.com
Home
»
Computational Linguistics
»
Computing Fundamentals
What Is a Computer?
✓
(4)
Data vs Information vs Knowledge
✓
Analog vs Digital
✓
Hardware vs Software
✓
Input / Process / Output / Storage
✓
Number Systems & Data Representation
(11)
Decimal System
✓
Binary System
✓
Octal System
✓
Hexadecimal System
✓
Conversions Between Systems
✓
Binary Arithmetic
✓
Two's Complement
✓
Floating Point Representation (IEEE 754)
✓
ASCII - American Standard Code for Information Interchange
Unicode (UTF-8, UTF-16, UTF-32)
Encoding vs Decoding
Hardware Components
(9)
CPU (cores, clock speed, cache, ALU, CU)
RAM (volatile memory, addressing)
ROM
Storage (HDD, SSD, NVMe)
Motherboard & Bus Systems
GPU (why it matters for NLP/AI)
TPU (Tensor Processing Units)
Input/Output devices
Network Interface Card
Operating Systems
(10)
What an OS does
Kernel
Processes & Threads
Memory management
File systems (FAT32, NTFS, ext4)
Directories & Paths (absolute vs. relative)
Permissions & Users
System calls
Linux vs. Windows vs. macOS
Shell / Terminal concept
The Linux Command Line (CLI)
✓
(29)
Powering Off, Restarting, and Exiting
✓
Finding Your Bearings: Who, When, and Where Am I
✓
Getting Help: Manuals and Command Lookup
✓
Moving Around: Navigating the Filesystem
✓
Managing Files and Folders
✓
Reading File Contents
✓
Searching Text with grep
✓
Finding Files with find
✓
Counting, Sorting, and Deduplicating Lines
✓
Cutting, Pasting, and Splitting Files
✓
Editing Text in Place with sed
✓
Processing Fields with awk
✓
Comparing Files
✓
Character Encoding and Locale
✓
Piping, Redirection, and Chaining Commands
✓
Compressing and Extracting Archives
✓
Checking System Resources and Hardware
✓
Managing Processes
✓
Networking and Downloading Data
✓
Remote Access and File Transfer (SSH)
✓
Permissions, Ownership, and Superuser Access
✓
Installing Software (Package Managers)
✓
Environment Variables
✓
Scheduling, Timing, and Calendars
✓
Shell Scripting Basics
✓
Text Editors
✓
Python and Virtual Environments
✓
Running Scripts in the Background
✓
Version Control with Git
✓
Networks & the Internet
(9)
IP addresses (IPv4, IPv6)
DNS
HTTP / HTTPS
TCP/IP model
Client-Server model
APIs & REST
JSON & XML formats
How browsers work
Localhost & Ports
File Formats Relevant to NLP
(7)
Plain text (.txt)
CSV & TSV
JSON
XML / HTML
PDF (and its complexity)
CONLL format
Annotation formats (BIO, IOB2)