Skip to main content

Fun with Modulo, Leap Years! and Unicode Hell (ruby program pt. 5)

Unicode Hell
This is cool. I innocently picked a simple problem (reversing text) to learn the Ruby programming language. I got more than I bargained for. Turns out that Unicode is supposed to be handled on the upcoming (since 5 years ago) Ruby 2.0.... great. So I'll have to implement an ad-hoc solution on my own. Part of the problem is that there are so many ways to encode text in bytes. And the other is that there doesn't seem to be a simple standard. The first problem is that Unicode text can be part of multiple character sets with different lengths. The most common UTF-16 should solve my program's problems but will trip up with weird characters in very obscure languages like say Cree. The second problem is how do you figure out which character set is a text in? I'm stumped for now.

Leap Years!
If you read my post on modulo you know I'm very excited about learning about this new programming operation and its use in jargon. (I'll have the protein burger! That's a normal burger modulo bun.) But right after that, I was trying to calculate a quick way to determine if a year is a leap year. So I looked up in Wikipedia and found out that they have a small program that tells you and it uses modulo! I was psyched. I read it and understood it right away.

Here is the pseudo-code from Wikipedia:

My leap year Ruby Program
if year modulo 400 is 0
       then is_leap_year
else if year modulo 100 is 0
       then not_leap_year
else if year modulo 4 is 0
       then is_leap_year
else 
       not_leap_year

Notice that if a year divides by 100 but not 400, it is not a leap year. Kinda tricky, huh? All this so March 21 can be around spring equinox.

Here is my program:
I tried to write it as close to the pseudo-code as I could. Notice how similar they are. But mine is code. It runs. By the way, elsif is a contraction of else if; I could have written it either way. (Elsif sounds like an elven maiden name, right?) The stuff below the yellow highlight gets text from the user, chops off the last end-of-line character or return. Converts the input to an integer, (a very important step that I missed originally) and out puts the result.

Now what I want to do is make it a method call that works after a period. So you can go year.leap? and get true if it's a leap year, false if it's not and nil if it's a string.

Fun with Modulo
All this is fun really got me thinking about one of pet peeves about the way math is explained. I hear (I used it above) "If a year divides by 100 but not 400, it's not a leap year." Well that's hogwash; any year will be divisible by 400. Take the non-leap year 1000 (Y1K!). Divide 1000/400 and get 2½. Oh the horror! The world goes crazy! A fraction! A decimal! (2.5) What are we to do?? Talk about confusing for a kid (or an adult preparing for the GRE).

Well now I know how to say this concept without doing some weird mathematical impossibility. If a year modulo 100 is 0 but modulo 400 is not, it's not a leap year. It just sits down, proper and still, and doesn't go around gallivanting, jumping and leaping about like that year 2K.

ps. Y1K was a leap year, because the calendar calculation I'm using is from the Gregorian Calendar, created in 1500's. But people forget how they were in their youth and judge the next generation harshly nonetheless... 

Comments

Popular posts from this blog

How to configure Ubuntu's keyboard to work like a Mac's

Typing accents on a PC is a complicated Alt + three numbered code affair. One feels like a sorcerer casting a spell. "I summon thee accented é! I press the weird magical key Alt, and with 0191 get the flipped question mark!" For a bilingual person this meant that writing on the computer was a start-and-stop process. With Mac's it a whole lot easier, just Alt + e and the letter you wanted for accents and alt + ? for the question mark. No need to leave the keyboard for the number pad and no need to remember arcane number combinations or have a paper cheat sheet next to the keyboard, as I've seen in virtually every secretaries computer in Puerto Rico.

Linux has a interesting approach to foreign language characters: using a compose key. You hit this key which I typically map to Caps Lock and ' and the letter you want and voilá you get the accent. Kinda makes sense: single quotation mark is an accent, double gets you the ümalaut, works pretty well. Except for the ñ, wh…

Fixing Autocomplete in Github's Atom Text Editor for Ruby

I really like Github's Atom Text Editor. I really like that it's multi-platform allowing me to master one set of skills that is transferable to all platforms and all machines. 

On thing that just burns me of the default set-up in Atom is the Autocomplete feature that seems to change my words as a type them. Because Ruby uses the end of line as a terminus for a statement you usually finish a word with pressing the return button and you get really annoying changes to your finished typed word a la MS Word. I find myself yelling "No that's not what I wrote!" at the screen in busy coffee shops.

I disabled autocomplete for a while but it is a very useful function. Then I found out they changed the package that gave the autocomplete to a new one called "Autocomplete Plus" that gives you more options. All that I needed to change to make autocomplete sane again:

1. Open Atom's Preferences
2. Search the bundled packages for "Autocomplete Plus"

3. Go to t…

Contrasting Styles of Writing: English vs. Spanish

There is interestingly enough a big difference between what's considered good writing in Spanish and English. V.S. Naipul winner of the 2001 Nobel prize for literature publish an article on writing. In it he emphasizes the use of short clear sentences and encourages the lack of adjectives and adverbs. Essentially he pushes the writer to abandon florid language and master spartan communication. This is a desired feature of English prose, where short clipped sentences are the norm and seamlessly flow into a paragraph. In English prose the paragraph is the unit the writer cares about the most.

This is not the case in Spanish where whole short stories (I'm thinking this was Gabriel Garcia Marquez but maybe it was Cortázar) are written in one sentence. Something so difficult to do in English that the expert translator could best manage to encapsulate the tale in two sentences. The florid language is what is considered good writing in Spanish but unfortunately this has lead to what …