- cross-posted to:
- [email protected]
- cross-posted to:
- [email protected]
man -k
to the rescue:mbsrtowcs
,strxfrm
andwcstold
are C functions.wcsoll is a mispronunciation of wcscoll
Oh no. You tell them forbidden knowledge of reading manual.
Who wants to write C functions for the rest with me?
Why are they even named like this?
When I read code, I want to be able to read it…
Is this from a time when space was expensive and you wanted to reduce the space of the source files on the devs PC???
For me (with a native language != english), this made it a lot harder to get into programming in the first place.
I recently held a science slam about this topic! It’s a mix of the first computer scientists being mathematicians, who love their abbreviations, and limited screen size, memory and file size. It’s a trend in computing that has been well justified in the past, but has been making it harder for people to work together. And the need to use abbreviations has completely gone with the age of auto completion and language servers.
mathematicians, who love their abbreviations
Man, I hate that so much. I swear this was half the reason I struggled with maths and physics, that these guys need to write this:
Rather than this:
At some point, they even collectively decided that not having to write a multiplication dot is more important than being able to use more than a single letter for your variables. Just what the fuck?
Thing is, you usually define all your variables. At least we do in engineering (of physical variety, rather than software).
Mostly because we can’t expect everyone reading the calculation to know, and that not everyone uses the same symbols.
Not explaining each variable is bad practice, other than for very simple things. (I do expect everyone and their dog reading a process eng calc to know PV=nRT, at a minimum).
Just like (in my opinion) not defining industry specific abbreviations is also bad practice.
Mathematicians don’t do this? Shame on them.
I mean, it was rather physics that was worse in this regard.
Mathematicians do define their variable quite rigorously. Everything is so abstract, at some point you do just need to write down “this thing is a number”. Problem with maths folks is rather that they get more creative with their other symbols. So, “this thing is a number” is actually written as “∃x, x ∈ ℝ”.
But yeah, in the school/university physics I experienced, it was assumed that you knew that U is voltage, ρ (rho) is density, ω (omega) is angular velocity etc…
At one point, I had to memorize six pages of formulas and it felt like every letter (Latin, Greek, uppercase, lowercase, some Fraktur for good measure) was a shorthand for something.What’s PV? Asking for my friend’s dog.
(Pressure) * (volume) = (# moles) * (gas constant) * (temperature)
The ideal gas law.
In another thread I admit I didn’t explain my position here well enough. I would only not explain this equation given sufficient context (e.g. I’ve shown all those variables in a table, and my intended audience is people familiar with basic chemistry, which I’d expect would be everyone reading the report for this particular example, since this is high school chemistry, and the topic of all reports I work on is chemical engineering.)
People can read the conclusions if they’re not familiar with chemistry, and for the detail, they’re not my intended audience anyway.
Generally I still hold the position that you should define variables as much as possible, unless it’s overly cumbersome, given your intended audience would clearly understand anyway.
In context this simple equation is obvious even if you change the symbols, as long as there is sufficient context to draw from.
Still bad, i’m not a computer with a lookup table in memory.
I do expect everyone and their dog reading a process eng calc to know PV=nRT, at a minimum
What is “eng”?
Lol fair point regarding Eng: “Engineering”.
But Nah, I think assumed knowledge of PV=nRT is fair in context, since if you don’t know what it is, you’ll only be reading the conclusion, not getting into the weeds of a calculation document.
I’m not going even going to be explaining if I have a column that’s says volumetric flow rate, with V=m/ρ. If I give mass flow rate and density (with units, of course), and use these extremely common symbols, and someone doesn’t understand, then they have no real business getting to this level of detail anyway.
I do agree that in most cases not defining your variables is bad practice, but there is some nuance, depending on the intended audience and how common a formula is, and the format of whatever it is you’re writing.
So, in the end you just do assume everyone to know the “common sense” one-letter notation for everything. Well, not everything, but the essential ten thousands of entities for sure /s
This sounds like No true Scotsman fallacy to me
I find it a bit contradicting to the very point you made about defining variables. If anything, one might be some home-grown genius that has real business getting into details but only ever used Chinese characters as variables
Edit: forgot to set language
Understand your frustration with how I’ve communicated my position, sorry about that:
My justification for the examples I’ve given is there still needs to be other context, is based on complexity of the equation, and the intended audience of that equation.
An example of me not explaining a very simple equation would be perhaps a table of various cases:
| — | mass flow (kg/hr) | density (kg/m³) | Volumetric flow (m³/hr), V = m/ρ | | Case 1 | blah blah | blah blah | blah | | Etc. | … | … | … |
Realising now that markdown tables don’t seem to work 😅, hopefully this is still clear.
It may be a touch better to put variable symbols in the other columns, but:
- You still have the final answer (the purpose of my report, I’m not writing a thesis here)
- It should be plainly obvious by the units, and the fact those are the previous two variables, to someone who has the ability to understand (and is the intended audience of that little equation)
As a recent example for this, in a data sheet I recently prepared, I literally just put a * in the references column and said “*calculated from other data sheet values” for the volumetric flow rate, because the intended audience would know how to do that, and the purpose was for me to communicate how that value was determined.
Me putting in the V = m/ρ in the hypothetical example I gave above is a just a little mind jog for the reader.
Where more complicated equations are used, of course these are properly referenced, usually even with the standard or book it’s come from.
I’ll redefine my position to: Clearly define all variables, unless it’s abundantly obvious to your intended audience from context.
My intended audience of the conclusions or final values are the layman. My intended audience of everything else is someone with a very basic chemical engineering understanding.
Your last point is a strawman:
I find it a bit contradicting to the very point you made about defining variables. If anything, one might be some home-grown genius that has real business getting into details but only ever used Chinese characters as variables
Because I’m writing in English, for an English speaking audience, and there is no such thing as a home-grown genius getting into my area because it’s a legal requirement that they have an honours degree. Even still, the two assumed knowledge equations I mentioned, which I would only not reference with sufficient context, would STILL be recognisable with totally random symbols.
| mass flow (kg/hr) | density (kg/m³) | Volumetric flow (m³/hr), 容 = 质/密 | Yup, a bit odd in an English context, but with the units information (always mandatory, of course) completely understandable.
Try to write the above with pen and ink and then tell me if you can read it back yourself.
Single letters is not a good system but it was the less bad one.
Try writing 20 algebraic manipulations of the equation on paper and you’ll quickly understand why it’s written that way.
The bottom is absolutely not more readable, and it’s much more difficult to work with.
Using full names like that might be fine for explaining a physical rule, or stating the final result of some calculation - but it certainly would be cumbersome and difficult for actually carrying out the calculations. In many cases we already fill pages with algebra showing how things can be related and rearranged to arrive at new results. That kind of work would be intractable with full word names for the variables, partially because you’d be constantly spilling off the end of the page trying to write the steps; but also because having all that stuff would actually obfuscate what you are trying to do - which is algebra. And during that process, the meanings and values of the pronumerals is not as important has how they interact with each other. So the names are just a distraction.
For setting up an equation, and for stating the final result, the meanings of the variables are very important; but during the process of manipulating the equations to get the result you want the meanings of the letters are often ignored. You only need to know that it is something that can be multiplied, or inverted, or subtracted, or whatever. Eg. suppose I want to rearrange to get the velocity. I don’t care that I’m dividing both sides by the air density times the drag coefficient and the area… I’m just dividing ρCA, which is an algebraic blob whose interpretation can be saved for some other time.
This is absolutely true, but it still seems to me that we’re throwing the baby out with the bath water when we just stick to extremely terse symbols for everything regardless of context.
Reading articles would be so much easier if they used even slightly longer names – thankfully more and more computer science articles do tend to use more human readable naming nowadays, at least.
Sure, longer names make manipulation harder a bit more annoying if you’re doing it by hand, but if you do need to manipulate something you can then abbreviate the terms (and I’m 60% sure I’ve seen some papers that had both a longer form and a shorter form for terms, so one for explaining shit and one for the fiddly formal stuff)
Of course using terse terms is totally fine when it’s clear from the context what eg. ∆x means.
/s?
Nope.
Bruh how large should our notebook pages be? Also how should we speak about the equation? What if the terms should be represented in a matrix? What if mathematical functions e^x, sin, ln etc. are present? Would you write sine of e^(velocity of the particle B) ? Notations are necessary for readability
You only have to define it once in a document, book, whatever. Also, it’s not like you’d ever need to do this for handwritten notes, only for a wider audience, or if you intend for something to be read by not just you.
No one is suggesting you don’t use symbols, just that you define them, and not assume the reader uses the same symbols as you. Which, so often, they don’t. (How many different ones have you come across just in highschool and uni. I came across multiple)
I’m no physicist, but surely there is a huge range of symbols for the same thing, especially the more niche you get.
I’m not a mathematician, but I agree with you because this is precisely how one would abbreviate repeating terms in a paper (e.g. The Museum of Modern Art (MoMA) and the Metropolitan Museum of Art (The Met) are both located in New York, New York (colloquially, New York City, or NYC). While MoMA has an art collection of about 200,000 pieces, The Met houses 1.5 million works of art.)
It’s been really holding me back in learning coding. I felt pretty comfortable at first learning javascript, but as I got further the code was increasingly hard to look back to and understand, to the point I had to spend a lot of time understanding my own code.
Does it truely matter after the code has been compiled if it has more full words or not?
It matters as soon as a requirement change comes in and you have to change something. Writing a dirty ass incomprehensible, but working piece of code is ok, as long as no one touches it again.
But as soon as code has to be reworked, worked on together by multiple people, or you just want to understand what you did 2 weeks earlier, code readability becomes important.
I like Uncle Bobs Clean Code (with a grain of salt) for a general idea of what such an approach to make code readable could look like. However, it is controversial and if overdone, can achieve the opposite. I like it as a starting point though.
I’ve heard it’s because old screens were like 60 character wide
Also punched cards had around 80 columns, which put a hard limit on the number of characters per line.
I recall reading somewhere the earlier compilers had a hard limit on the length of function names, due to memory constraints.
strncpy becomes stringnumbercopy. You can see why short version is used.
Why not just add function overloading to the language and have a function named
copy
that takes a string and an optional character count?And with a bit of namespacing and/or object orientation and usage of dots, it becomes perfectly readable.
There are also camel case and underscores in other languages…
BTW: How on earth should a newcomer know that the letter “n” in that word stands for number without having to google it? The newcomer could even assume that it’s a letter of the word string… And even, if you know that it stands for number, it’s still hard for me to understand what it means in this context… I actually had to google it… But that’s probably some C++ convention I don’t know about, because I don’t program in C++…
C is a little older than namespacing and object orientation. C++ wasn’t even a glimmer in Bjarne’s eye when these conventions were laid down.
And yes, having to google it is part of the design. Originally C programmers would have had to read actual manuals about this stuff. Once you learn the names you don’t really forget so it works well enough even now for ubiquitous standard library functions.
And yet, C was an ergonomic revelation to programmers of the time. Now it’s the arcane grandpa that most youngsters don’t put up with.
man strncpy
How on earth should a newcomer know that the letter “n” in that word stands for number without having to google it?
By looking at the difference between
strcpy
andstrncpy
. Preferably, though, you should simply learn C before writing C.The gist of is is that
strcpy
takes a null-terminated string and copies it somewhere, whilestrncpy
takes a zero-terminated string and copies it somewhere but will not write more thann
bytes.strncpy
literally has exactly one more parameter thanstrcpy
, that beingn
, hence the name. Ifn
is smaller than the string length (as in: distance to first null byte) then you’re bound to have garbage in your destination, and to check for that you have to dereference the pointerstrncpy
returns and check if it’s actually null. Yay C error handling.In retrospect null-terminated strings were a mistake, but so were many other things, at some point you just have to accept that there’s hysterical raisins everywhere.
If
n
is smaller than the string length (as in: distance to first null byte) then you’re bound to have garbage in your return destinationWha? N is just maximum length of string to copy. Data after dst+n is unchanged.
In retrospect null-terminated strings were a mistake, but so were many other things, at some point you just have to accept that there’s hysterical raisins everywhere.
All hail length-prefixed strings!
Data after dst+n is unchanged.
Sure but that means the part before that is garbage because you have a null terminated string without terminator.
Or at least that’s how I see it. If your intention isn’t to start and end with a null-terminated string you should be using memcpy. Let us not talk about situations where
CHAR_BIT != 8
that’s not POSIX anyway.Even better, just avoid doing string manipulation in C.
Let us not talk about situations where
CHAR_BIT != 8
that’s not POSIX anyway.Yeah, let’s not talk about 20-bit one’s complement ints.
It’s from a time keyboards were so hard that you needed to do push ups on your finger tips if you wanted to endure a 9 to 5 programming job.
The reason people love IBM Model Ms nowadays is because the springs have been worn in now and can easily be pressed without additional training.
Rhowch, cwtch, mwyn have to be Welsh. Classicly Welsh sounding words, and mbrsrtowcs, strxfrm can’t possibly be Welsh. Source: my welsh uncle taught me to pronounce Welsh place names.
Wcstold, wcsoll wmffre could be either but sound really weird as Welsh to me.
Wmffre is actually the Welsh spelling of the name “Humphrey”
I love the Welsh, but holy shit that’s not what those letters are supposed to be for. They and the Irish just made a bunch of shit up when they started to standardize spelling. It makes me understand how Russians feel when Westerners use Cyrillic letters improperly.
the letters are “supposed to be” for Latin, a language with only five different vowel sounds.
everyone since has just been making a bunch of shit up.
I get that, and i also understand that English shifted it’s vowels compared to similar languages. But aside from French, my American brain can kinda figure out how to pronounce Germanic and Romance languages, whereas languages such as Welsh and Polish seems to have applied completely different rules to the Latin alphabet than everyone else.
at least welsh and polish actually read the letters that we write down, unlike some languages
So, you’ve got no issues with “g” being sometimes kinda “h”, “j” being same kind of “h” always, “h” not being a sound a all, “d” sounding like “th”, and “z” sounding like “th” but another “th”, not the one for “d”. Oh, and “c” sounding either like “k” or like the latter “th”
I know some people that claim that everyone should use Latin alphabet, because you then know what things sound like, but that is the most bullshit take I ever heard. I guess that knowing how to write letters helps, but it looks like every other language pronounces those letters different, and English makes extra effort to pronounce different even the same things
Having read your comment I’d like your views on “Wrwgwai” - the South American country of Uruguay.
It’s easy. W is a vowel in Welsh. It sounds similar to ö in German and it can be modified as ŵ to elongate the sound such as in the word dŵr which means water.
Wrwgwai or Wcrain (for example) are the natural way to spell those countries using the Welsh alphabet. Its a highly phonetic language believe it or not.
In English it is literally called ‘double u’
Even tho it’s more like a double v. Always struck me as odd.
Afaik, comes from Latin that had no “U” and “V” was both vowel and consonant until some point in time.
Nice.
Easy. The ones with vowels are C library functions.
I prefer names like these to names that are common words. Even the name of the language is annoying because the letter C isn’t exactly uncommon in other contexts. I can’t blame the people who named the language because they did it long before search engines were a thing, but what excuse do people now have?
So you’re saying we should create a programming language called “Welsh” with C-like function names?
No because that would imply that Welsh is not just as valid a language as English and I don’t want to be wedi’i gywiro’n gwrtais.
Are there any of them that are both?
I can’t remember it, but I read one Microsoft blog post (in Vista era?) about how one team at Microsoft would develop some amazing new Windows component. They’d proudly name it AmazingNewService.dll. And then the operating system team would come in and say “that’s all fine and good, but you have to conform to the naming convention.” 8+3 filenames. First two letters probably “MS”, because of reasons. …and 15 years later, people still regularly go “What the fuck is MSAMNSVC.DLL?”
Why are they still so hung up on 8.3 long after Win95?
I get not wanting to have spaces in a filename. Those suck.
Is there something low-level that still doesn’t like long filenames?
Well this was Vista era, they were probably doing that to ensure some sort of expectation from particularly tricky legacy apps. Windows prefers not to break old apps if at all possible.
sounds interesting~, any chance that I can find this post?
Like I said this was in the Vista era. Or possibly before the Vista release, part of the Longhorn hype train (Longhorn got some super hyped features, such as an epic next-generation filesystem to replace NTFS, which Microsoft ultimately canned, and Vista ended up, you know, being Vista).
This was so long ago that I unfortunately don’t remember what exact feature this was about, but it was about some new Windows component.
Thanks anyway, first time know the 8+3 filename thing in MS
wcs?
Welsh?
Ugh apparently my asterisks turned into italics.
All the words with wcs should be functions. Not sure about the rest of them
I still vote for Welsh
Does \ to escape work? *testing*
You need to escape them with backslashes like so: \*
Rhowch, mwyn, and wnffre are Welsh. The rest is nonsense.
What are those in English?
Rhowch = give / enter supply? Kinda? Like “enter password” is where i’ve seen it.
Mwyn = mine as in dig
Wmffre = welsh version of humphrey and i only know that due to my great-grandfather.
Cwtch is the most obviously Welsh word there.
Obviously.
I’m bad at naming things too🖐️
I thought I would be better at this game than I am.