Well, GNU grep was last released 16 months ago, and the last change to its master branch was 4 weeks ago: http://git.savannah.gnu.org/cgit/grep.git
FreeBSD's grep was last updated back in August: https://github.com/freebsd/freebsd/tree/master/usr.bin/grep
OpenBSD's grep was last updated 11 months ago: http://cvsweb.openbsd.org/cgi-bin/cvsweb/src/usr.bin/grep/
Oddly, it looks like the Darwin grep was last updated in 2012: https://opensource.apple.com/source/text_cmds/text_cmds-99/g...
Strange that Apple would be shipping such an ancient grep.
I don't think it is that strange. Command line tools such as grep don't appear to be a development priority for Apple. Their focus appears to be on features visible to the average user, who uses the GUI instead of the command line.
Command line tools are mainly used by developers and power users, and the existing tools are generally good enough for most purposes, and people who want something better can always install the GNU versions using Homebrew/MacPorts/etc. There isn't much market demand for improvements in this area, so it makes sense Apple wouldn't invest in it.
A tldr of the middle would be cool. Maybe there was a pattern.
I'd like to add another OS not mentioned that will hopefully become a well-appreciated artifact soon too, from Redox OS: https://gitlab.redox-os.org/redox-os/coreutils/blob/master/s...
I can't find it quickly now, but jackpot51 also has a very answer somewhere on Reddit about how their networking stack's DNS query command departs from a commonly deployed C program for Windows and Unix, iirc. fascinating
Also I want to point readers to the commentary of some of the Unix authors:
“Old programs have become encrusted with dubious features. Newer programs are not always written with attention to proper separation of function and design for interconnection.”
http://harmful.cat-v.org/cat-v/unix_prog_design.pdf
My point being: Unix (and derivatives) encompass a set of people who disagree about what constitutes Unix philosophy.
References:
- 1971 draft (pre 1st edition) of the paper that would become the well-known 1974 CACM UNIX paper (earliest documentation on `cat` that I can find): https://www.tuhs.org/Archive/Distributions/Research/McIlroy_... (tune in on page 28)
- 6th edition cat(1) man page (1975): http://man.cat-v.org/unix-6th/1/cat
- 7th edition cat(1) man page (1979): http://man.cat-v.org/unix_7th/1/cat
Is it me or does 6 months seem like an awfully long time for re-writing such a small and simple program?
Maybe they ceded this part of the OS to Homebrew? I know I never try to update anything stock in the OS. It's so much easier/faster to `brew install xxxxxx` than mess with the OS which might get overwritten with an official update anyway.
$ uname -v
Darwin Kernel Version 13.4.0: Mon Jan 11 18:17:34 PST 2016; root:xnu-2422.115.15~1/RELEASE_X86_64
$ grep --version
grep (BSD grep) 2.5.1-FreeBSD
I don't have historical information, but that's at least consistent.Well, AF_UNIX sockets are sockets with paths in the filesystem. You must either connect() to them or bind() to them instead of open()ing them. Apparently, BSD-derived versions of cat will try to connect() to to a file if open() fails.
With GNU cat, if you try to cat a socket, it will go like this:
$ ls -l test.sock
srwxr-xr-x 1 luke users 0 Nov 12 21:07 test.sock
$ cat test.sock
cat: test.sock: No such device or address
but BSD-derived cats will successfully open the socket for reading. That behavior can be accomplished on other systems by using socat instead; BSD cat behaves somewhat like: $ socat UNIX:test.sock STDOUT $ uname -v
Darwin Kernel Version 8.11.1: Wed Oct 10 18:23:28 PDT 2007; root:xnu-792.25.20~1/RELEASE_I386
$ grep --version
grep (GNU grep) 2.5.1
Other man pages: ed says 1993, sed says BSD 2004, cat says 3rd Berkeley Distribution 1995.* 2002: 833 LoC (http://landley.net/aboriginal/history.html)
* 2013: 36kLoC, 2/3rds of them .h files (https://news.ycombinator.com/item?id=11340510#11341175)
* 2018: 37kLoC of .c file dependencies going into libcoreutils.a and some LoC of .h files (coreutils has 60kLoC of .h files)
The methodology for counting lines likely isn't consistent across those data points. But the trend is still unmistakeable. Maybe I'll tree-shake all the dead code out and come up with an accurate line count one of these days..
I found a comment claiming that prior to 10.8 (2012, Mountain Lion) it used GNU grep, but nothing I'd feel comfortable citing.
They are basically right though.
The counterexample of some Unix utilities means nothing. You're not getting a CS degree in order to develop the next version of cat, are you?
We have some things with a long history and they are easy to identify. It is just hindsight being 20/20.
For every one of those things, there are countless that can't be seen or felt. They aren't here; they got washed away.
Who uses the Michigan Terminal System?
Or a web framework from ten years ago?
The aunt and the cousin are thinking that 'computer technology' exists at the level of abstraction of the sandcastles in the metaphor. To some extent it does, but the vastly greater part of it is at the level of abstraction of the knowledge and theory of building sand castles, as gained over the course of many iterations.
One of the most common themes one hears, when reading what people write about computer science, is how few new ideas in computer science are actually involved in nearly anything anyone does on a computer (or teaches at the undergraduate level).
$ type grep
grep is hashed (/usr/bin/grep)
It does seem to be the original grep for this machine (it's a Mac Mini) - it has the same Jan 2006 date as most of the files in /usr/bin, and nothing has an earlier date. There's no other file called grep elsewhere.Well, I still write ASP.NET Web Forms on a regular basis. Ten years is not that old, or is it? Though it is harder and harder to find developers for it, the young people simply don't start with Web Forms.
They are only right in the same way that a physics major is obsoleted by advances in physics: lhc, discovery of dark matter & energy, increasing expansion of the universe, etc.
A CS major isn't about learning the latest Angular framework derivative. A CS major is about learning fundamental aspects of computer science.
"The Design and Implementation of the 4.4 BSD Operating System"
"The Design and Implementation of the FreeBSD Operating System"
"Mac OS X Internals: A Systems Approach"
"Solaris Internals: Solaris 10 and OpenSolaris Kernel Architecture"
"HP-UX 11i Internals"
"IA-64 Linux Kernel: Design and Implementation"
exa -l --git will list N/M git status flags in the output and:
exa --git-ignore will obey .gitignore when you're listing files :)
Works like a charm in my experience.
Xenix manuals and later Steven's books were my introduction into UNIX world.
* http://jdebp.eu./FGA/operating-system-books.html
One of these days I shall get around to expressing my opinions, which as you can see are still missing. Indeed, the list itself is a decade out of date. (-:
I have some SCO UNIX manuals on the other side of the room as I type this.
Indeed, the observation that some Unix utilities have their roots in the seventies misses the point in this regard. I'd say this is a testament to the success of the unix approach or whatever you want to call it. It's not really about computer science.
#!/bin/sh
Yes, that's really it. Fire up the shell, get it to exit with 0, which is taken as success. That's all that's really necessary for its spec.GNU's is around 29 KiB compiled, and it uses some of that to support --version and --help flags. MacOS's is around 17 KiB compiled and ignores flags.
gcc -I. -I./lib /
src/version.c /
lib/progname.c /
lib/safe-read.c /
lib/safe-write.c /
lib/quotearg.c /
lib/xmalloc.c /
lib/localcharset.c /
lib/c-strcasecmp.c /
lib/mbrtowc.c /
lib/xalloc-die.c /
lib/c-ctype.c /
lib/hard-locale.c /
lib/exitfail.c /
lib/closeout.c /
lib/close-stream.c /
lib/fclose.c /
lib/fflush.c /
lib/fseeko.c /
lib/version-etc.c /
lib/xbinary-io.c /
lib/version-etc-fsf.c /
lib/binary-io.c /
lib/fadvise.c /
lib/full-write.c /
src/cat.c
Those .c files add up to 5021 lines.The .c files include 44 header files:
lib/binary-io.h
lib/c-ctype.h
lib/closeout.h
lib/close-stream.h
lib/config.h
lib/c-strcaseeq.h
lib/c-strcase.h
lib/ctype.h
lib/error.h
lib/exitfail.h
lib/fadvise.h
lib/fcntl.h
lib/fpending.h
lib/freading.h
lib/full-write.h
lib/gettext.h
lib/hard-locale.h
lib/ignore-value.h
lib/limits.h
lib/localcharset.h
lib/locale.h
lib/minmax.h
lib/progname.h
lib/quotearg.h
lib/quote.h
lib/safe-read.h
lib/stdio.h
lib/stdio-impl.h
lib/stdlib.h
lib/string.h
lib/sys/ioctl.h
lib/sys-limits.h
lib/sys/types.h
lib/unistd.h
lib/unused-parameter.h
lib/verify.h
lib/version-etc.h
lib/wchar.h
lib/wctype.h
lib/xalloc.h
lib/xbinary-io.h
src/die.h
src/ioblksize.h
src/system.h
The header files add up to 19.7k lines.So the total line count for files GNU cat actually needs to build is at least ~25k.
(I didn't bother checking for headers including other headers.)
Next step: do this for various versions of GNU coreutils.
I created a wiki page to measure the number of lines of code* of various types of software https://softwarecrisis.miraheze.org/wiki/Linecount - LOC is a very very rough proxy for what I actually want to measure, but the results are so stunning that even a inaccurate indirect measurement tells a lot. You can see that for 2 projects that do essentially the same thing there might be a 1000x difference in LOC.
It's fascinating what can happen to such a simple program like 'cat'. The same effect is amplified further when you look at projects like gcc. I tried to ask the question on a couple sites like stackexchange and reddit why does gcc take half an hour to build instead of a fraction of a second but this question was not taken well. I got a lot of resistance to it, X-Y answers, deleted etc. I don't think that the common software engineer wants to take the idea seriously that the day to day tools we use have a million fold inefficiency built into them by accident. I also noticed that 'make' has no profiler, nobody has even really done a breakdown of what takes how long to build in the gcc tree.
There are a lot of brilliant engineers who understand this problem and want to solve it though. We see that in Alan Kay's STEPS project, aligrudi's work, musl, toybox, maybe sbase and many of the independent bootstrapping projects that have popped up. There's a lot of inertia and weight to the standard GNU toolkit to push back against but I believe these problems are all solvable and by solving them we can create programming languages and tools with leverage far beyond what currently exists. I just hope such projects can be integrated rather than be forgotten.
Neither of these statements are true. grep on FreeBSD is still GNU grep, and it has a distinct version text from bsdgrep:
$ grep -V
grep (GNU grep) 2.5.1-FreeBSD
Copyright 1988, 1992-1999, 2000, 2001 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
$ bsdgrep -V
bsdgrep (BSD grep) 2.6.0-FreeBSD
$ uname -rK
13.0-CURRENT 1300003Amusingly, the BSD socket behavior can be disabled with the compiler macro -DNO_UDOM_SUPPORT, but as far as I can tell it is not documented nor hooked into the rest of the build system in any way since its introduction in 2001:
https://svnweb.freebsd.org/base?view=revision&revision=83482
That's certainly a Unix truism! It seems everyone has their own subjective beliefs about what Unix should be and decides their own beliefs constitute "the" Unix philosophy.
MacOS:
* https://unix.stackexchange.com/questions/352977/
* https://unix.stackexchange.com/a/398249/5132
The very version of FreeBSD from some years ago:
% bsdgrep --version
bsdgrep (BSD grep) 2.5.1-FreeBSD
% grep --version
grep (GNU grep) 2.5.1-FreeBSD
Copyright 1988, 1992-1999, 2000, 2001 Free Software Foundation, Inc.
This is free software; see the source for copying conditions. There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
%
More on that:* https://unix.stackexchange.com/a/65609/5132
Kyle Evans and others on making bsdgrep into grep:
strcmp("bsdgrep (BSD grep) 2.5.1-FreeBSD", "grep (GNU grep) 2.5.1-FreeBSD") != 0
Also, I'm personally in contact with Kyle Evans and am familiar with the general interest in making bsdgrep grep. But I also know that it hasn't happened yet.