Tabs and Spaces Still Need Human Consistency

5 min read

Code Formatting Tabs vs Spaces AI Coding Tools Style Guides

The Long-Running Indentation War

For decades, developers have argued whether tabs or spaces make the better indent. Tabs save bytes and let each person set a width that suits them. Spaces guarantee identical alignment in every editor. The debate was mostly philosophical, until machine learning, powered coding assistants arrived.

So Why Does Indentation Still Matter? Many languages (Python, YAML, HCL, CoffeeScript) treat indentation as part of "syntax". Even in C-style braces, clean indents aid code review tools and automated formatters. When indentation is off, parsers can misread scope, and code intelligence engines may fail to build an accurate abstract syntax tree (AST).

AI Learns From Training Data so Large language models (LLMs) ingest terabytes of open-source code. If that corpus is biased toward spaces, the resulting model becomes better at predicting space-aligned structures. The reverse holds for tabs. An LLM does not intrinsically "count" tabs more cheaply than spaces; it simply sees a single control character versus several. What matters is 'how often' each style appears in the data and whether the model was fine-tuned to normalize both. The Key takeaway: The model’s formatting preference mirrors its diet.

Consistency Beats Preference

Mixed indentation, tabs sprinkled among spaces, confuses humans "and" AI. Any pattern learner struggles when stylistic signals fluctuate. You may see code completion drift, mis-aligned snippets, or linter noise. Maintaining a single, enforced style (via .editorconfig, prettier, black, or language-specific linters) ensures both humans and models operate on a stable baseline.

# Good: consistent spaces (4-wide)
def greet():
    print("Hello")

# Bad: models may misinfer scope
	def greet():
        print("Hello")

So has AI Solved the Debate? Modern tools such as GitHub Copilot, Tabnine, and Claude can detect the dominant indent style of a file and adapt on the fly. Some IDE plug-ins offer a "convert indentation" command that harmonizes an entire repository before an LLM sees it. In that sense, AI can "bridge" the gap: it will happily output tabs if the context starts with tabs, and spaces if spaces rule.

Yet two caveats remain:

First, training bias means that if your codebase is mostly tabs but the model learned on spaces, its first few completions may be incorrectly spaced until context primes it. Second, without CI enforcement, a teammate or bot could commit mixed styles, triggering the very inconsistency that weakens AI output.

Practical Workflow Tips

Lock in a style guide by adding an .editorconfig or linter rule at the repo root. Use tools like pre-commit, husky, or GitHub Actions to run prettier --write . or black . and guarantee uniformity. When prompting ChatGPT, include a properly indented code snippet first so the model mimics that style. Batch-convert legacy files once, then make linters mandatory to prevent regressions. Finally, to avoid obscure logic changes, adopt "whitespace-ignored" diff settings or separate style-only PRs.

AI code assistants haven’t crowned a universal winner; rather, they make the choice less painful, if your project is internally consistent. Pick tabs or spaces, codify the rule, and your LLM collaborator will respect it. The true battle today is not tabs vs spaces but consistency vs chaos. Win that war, and both humans and machines will read, write, and maintain your software with far fewer hiccups.

Need senior engineering leadership?

Engage a partner-led engineering firm that agrees on fixed fees, written scope, and accountability for outcomes instead of hours.

Talk to a partner

Access to semperMade's services is highly selective and subject to approval.