Why, really?
- Become shell literate: Attempt at trying to persuade you that this whole thing makes sense. Full disclosure: the author of the article is a well-known free software advocate, so he is far from impartial in his article. That said, he is certainly not alone in suggesting it; here is another example from Letters To A New Developer
UNIX philosophy
- Unix philosophy (by Doug McIlroy, creator of Unix pipes) is to:
- Write programs that do one thing and do it well
- Write programs to work together
- Write programs to handle text streams, because that is a universal interface
Materials
- Learn long-term from:
- If you are in a hurry:
- If you are in a real hurry:
Worth reading or watching
- The UNIX Command Language: This paper from 1976 (!), written by no one else but Ken Thompson, is the first paper ever published on the Unix shell.. If for nothing else, it's almost certainly worth reading for its amazing clarity of presentation and concise treatment.
- AT&T Advertisement for UNIX: Watch Brian Kernighan describe (in a very down-to-earth fashion) what's great about UNIX, especially how pipes play an important role in that.
- Introduction to text manipulation on UNIX-based systems: A very extensive in-depth guide into what's possible with just the standard tools, when it comes to text processing on UNIX-like systems. (Spoiler alert: a lot!)
- Linux Filesystem Hierarchy: A deeper discussion on the various parts of the standard Linux filesystem, describing several of the directories in much higher detail than the slides ever could. Less in-depth also on Wikipedia.
- How dotfiles came to be: A short story by Rob Pike about how dotfiles came to be and what it says about the unintended effects of cutting corners and just "hacking around" a problem.
- Learn regular expressionsLearn regular expressions
Intro
Regular Expressions = "regex" or "regexp"
Comes from the ed editor but you'll mostly encounter the grep program
A quick way of describing a particular pattern of characters in text
Allows for extremely effective search and replace
Comes from the ed editor from UNIX but you'll mostly encounter the grep program
Knowing [regular expressions] can mean the difference between solving a problem in 3 steps and solving it in 3,000 steps. When you’re a nerd, you forget that the pr...: Quite a bit of information on regular expressions. - Learn VimLearn Vim
Philosophy
Optimise for reading code/text, not writing
Be programmable by itself (keystrokes are composable commands)
Avoid the mouse (too slow) or even the arrow keys (too much hand movement)
In practice this means using different "modes of operation" for different kinds of tasks.
Vim modes
NORMAL (<ESC>, sometimes double)
Default mode the editor starts in; one should spend most of the time here
Each keypress is equivalent to an editor command
IN...: quite a bit of details about Vim. - Symlinks, Hardlinks, Reflinks and ML projects: This article goes deeper into how these concepts of links can be used for various Machine Learning (ML) projects where you work with a ton of data.
- ShellCheck: This page allows you to easily find bugs in your shell scripts.
- UNIX: A History and a Memoir by Brian W Kernighan: A historical account of how UNIX came to be by someone who was there when it happened. It will help you paint the proper picture of what is meant when people say stuff like "UNIX legacy" or "the UNIX era".
- The Cuckoo's Egg: Tracking a Spy Through the Maze of Computer Espionage by Cliff Stoll: A true story of a physicist who tracked one of the first documented "hackers" who he found snooping around his systems. The best part is that it's all real, down to the (obviously UNIX) commands that were used.
Make yourself at home
- Use iTerm2, the terminal for MacOS
- Use
tmux
, terminal multiplexer with simple setup that can be also used for pair-programming - Use
ohmyzsh
as framework for managing zsh configuration or in particularzsh-autosuggestions
for autosuggestion in zsh shell - Later, use aliases or functions and define them in
.bashrc
or.bash_profile
, e.g.alias cx='chmod +x'
ormcd() { mkdir -p $1; cd $1 }
- Customise prompt through
starship
or like here:
export VIRTUAL_ENV_DISABLE_PROMPT=1
function virtualenv_info {
[ $VIRTUAL_ENV ] && echo ' ('`basename $VIRTUAL_ENV`')'
}
# https://github.com/git/git/blob/master/contrib/completion/git-prompt.sh
source ~/.git-prompt.sh
function git_info {
__git_ps1 "(%s) "
}
# https://zsh.sourceforge.io/Doc/Release/Prompt-Expansion.html
setopt PROMPT_SUBST ; PS1='%F{green}@%*%f %F{cyan}[%n]%f%F{yellow}$(virtualenv_info)%f%F{red} $(git_info)%f%F{magenta}%~%f $ '
Commands
- Anatomy of a typical command:
# cmd1 -options arg1 (pipe) cmd2 arg2
ls -lh /etc | grep 'conf'
!!
: repeat the last commandcd -
: change to previous directory; btw check outz
orj
, toowhich
: check if a command is available and if so, from where it is runman
/info
(ortldr
or eventealdeer
as modern alternatives): show docscat
(orbat
as modern alternative): read your files.touch file.txt
: create an empty filectrl-r
: recall from history;history
: print history-
Almost all of these are core utilities commands
less
: Read your long files (pagination, scrolling, etc.):
cat long_text.txt | less
read
: Read a value from input:
echo "What's your name?"
read Name # not declaring a new variable
echo Hello, $Name!
ls
(orlsd
orexa
as modern alternatives): List your files properly but make sure to read the info right if needed.:
ls -lh
chmod
: Change permissions to make a file executable (or you can use octal representation.)
# chmod {u,g,o,a}{+,-,=}{r,w,x} file
chmod u+x samlapi.py # add execute rights for user
sudo
: Do as superuser to check yourPATH
:
sudo nano /etc/paths
ps
(orprocs
as modern alternative): List all process statuses:
ps -e
kill
: Kill a process or just stop a process and start again, right where you left off, based on this:
# kill
ps -e | grep -i "[process name]"
kill -9 [pid]
# stop and start again
kill -SIGSTOP [pid] # or -19
kill -SIGCONT [pid] # or -18
jobs
: List your running process and use<Ctrl-Z>
with bg
, fg
to send them to background/foreground:
$ sleep 1000
^Z
[1] + 18653 suspended sleep 1000
$ jobs
[1] + suspended sleep 1000
$ bg %1
[1] - 18653 continued sleep 1000
$ jobs
[1] - running sleep 1000
bc
: Calculate better:
echo "9.45 / 2.327" | bc -l # => 4.06102277610657498925
cut
,paste
: Taken
th column from CSV and sum it:
cat file.csv | cut -d, -f[n] | paste -sd+ | bc
iconv
: Convert text form one character encoding to another:
iconv -f [encoding] -t [encoding] -o [outputfile] [inputfile]
find
(orfd
as modern alternative orfzf
for fuzzy finding),xargs
: "Parametrise" standard input line by line and "apply" a command on each line, esp. useful with preceding pipe, e.g. for removing empty files; more info here:
find . -type f -empty | xargs rm # or xargs -I{} rm {}
ssh
(ormosh
as a modern alternative): Access remote servers through a "Secure SHell" whilst generating keys viassh-keygen
(stored in~/.ssh/id_ed25519
) and even usingssh-agent
for not needing to type the passphrase every time or~/.ssh/config
to create aliases for hosts, e.g. as on GitHub:
ssh foobar@192.168.1.42 ls -l
ssh-keygen -o -a 100 -t ed25519 -f ~/.ssh/id_ed25519
ssh-copy-id -i .ssh/id_ed25519 foobar@remote
# copy a file
cat localfile | ssh remote_server tee serverfile # tee: STDIN > STDOUT + file
# copy more files
rsync source_folder destionation_folder # or you can try rclone
# local port forwarding
ssh -L 9999:localhost:8888 foobar@remote_server # link local 9999 to remote 8888
curl
: Outputs the file it reads from the network tostdout
; handy to usepup
if working with HTML andjq
if with JSON:
curl uniba.sk > index.html
sed
(orsd
as modern alternative): Take in a stream of text line by line and transform it in one, esp. for substitution (s
) but also deletion (d
) or printing (p
).
# cat [filename] | sed [addr]X[options] # addr=lines, x=cmd
cat text.txt | sed -E 's/Unix/UNIX/g' # s/[regex]/[replacement]/[flags]
cat text.txt | sed -E 's/(her)/[\1]/g' # \m to reference groups in ()
cat text.txt | sed -E 's/[0-9]+/[&]/g' # & to reference the whole match
awk
: Scan file for lines that match any of a set of patterns and make an action. It basically translates the below into simplepattern { action1[; action2] }
(yet quite powerful).
for line in file.readlines():
for pattern, actions in patterns_actions:
if pattern.match(line):
eval(actions)
cat text.txt | awk '/regex/' # rows that match regex
cat text.txt | awk -F, '{ print $n }' # print n-th column for each row where , is Field Sep
cat text.txt | awk '$4 == "F" { print $1 }' # 1st col for rows with "F" in col4
ls *.txt -l | awk '$5 >= 100 { sum += $5 } END { print sum }' # sum sizes of txt files over 100 B
cat people.txt | awk '{ p[$4]++ } END { for(i in p) print i, ":", p[i] }' # agg
Scripting
- Some notes on it below but if you want to go deeper, Advanced Bash-Scripting Guide would be where to go
- Especially when starting, it might be a good idea to use
spellcheck
#!/bin/bash
: Use a shebang as a header to define the absolute path to the file's interpreter directly in the script (and not later on the command line), i.e../script.sh
instead ofbash script.sh
printenv
: See all pre-set variablesexport VAR=exported
: Export variable to be exposed to child processes, otherwise variables are local to the process in which they are defined.echo '$name' vs echo "$name"
: First returns verbatim, second interpolates first. Well, (almost) always double-quote your variable references.echo ${my_array[0]}
or sometimes just simplyecho ${MY_VAR}
: Parameter expansion is also usefula=$(echo "hello")
: Command expansion to save output to variable.{1..10}
: Brace expansion even more$(( 10 + 5 ))
: Simply calculate or maybe just usebc
0
exit code is success ("true"),1
is fail "false").-
cmd1 | cmd2
,cmd1 && cmd2
,cmd1 || cmd2
,cmd1 ; cmd2
: Multiple commands can be run together. - If statements:
#!/bin/bash
if [ "$USER" == "root" ]; then
echo "You may proceed";
elif groups | grep -q sudo; then
echo "Please become root to run this"
else
echo "Sorry, only root is allowed to run this";
fi
# Most often, `test` (`man 1 test`) is used; or [ ], or even , for short
# BTW, there is a difference: http://mywiki.wooledge.org/BashFAQ/031
if [ -d .tmp ]; then
echo "The directory .tmp exists; proceeding."
fi
- Cases:
#!/bin/bash case "$1" in root) echo "Welcome, you can come in" ;; mrshu) echo "Please provide password" ;; *) echo "Name not recognized";; esac
- While loop:
#!/bin/bash
while ps -ef | grep -v grep | grep firefox; do
echo "Firefox not running, will check in 10 seconds"
sleep 10
done
- For loop:
#!/bin/bash
for i in $(seq 1 5); do
echo "Checking number $i"
done
- Inplace files:
sh << EOF
echo "Hello World"
END<enter>
- Fallback for no args
if [ $# -eq 0 ]; then
#no args
else
#some args
fi
Modern alternatives
- Modern Alternatives of Command-Line Tools
- Modern shells:
When (not) to use it
In general, bash scripts are useful for short and simple one-off scripts when you just want to run a specific series of commands. bash has a set of oddities that make it hard to work with for larger programs or scripts:
- bash is easy to get right for a simple use case but it can be really hard to get right for all possible inputs. For example, spaces in script arguments have led to countless bugs in bash scripts.
- bash is not amenable to code reuse so it can be hard to reuse components of previous programs you have written. More generally, there is no concept of software libraries in bash.
- bash relies on many magic strings like
$?
or$@
to refer to specific values, whereas other languages refer to them explicitly, likeexitCode
orsys.args
respectively.
Therefore, for larger and/or more complex scripts we recommend using more mature scripting languages like Python or Ruby.