Never seen this being done before.
I contacted a Bank of America representative over the telephone, and they needed to authenticate me before we could get down to business. I was asked my name, and was then asked to confirm a series of questions. I was not asked to state these facts myself - they stated it themselves, and asked me to confirm it.
Silly way to verify a person's identity, one would think. Anyone could pass of as me, if they had my name and my account number - and if they simply confirmed every detail that the CSR gave them. However, the CSR did make one minor mistake while stating the details - my phone number was off by one number, and I corrected them promptly. Looking back, it is quite clear that the mistake was deliberate. In this way, I did not say out any of my personal details out loud (which would have been terrible in public), and I pretty much authenticated myself by correcting one random mistake that they chose to make.
September 11, 2011
April 03, 2011
"Quadratic space will kill you faster than quadratic time"
Learnt it the hard way. Ouch.
(Prof Steven Skiena: Lecture on homology)
March 20, 2011
A distributed pipeline for processing text
Usually, Hadoop is the way to go.
However, I have joined a project that has been underway for more than a year, and the processes have been written in mostly an ad-hoc way - shell, python, and Java standalone programs. Converting each of these to mappers and reducers would have been an arduous task.
I decided to re-write the pipeline in SCons. There are many things about this pipeline that represent a conventional build. There are dependencies, and usually newer functionality/processing is added to the later stages of the pipeline. Luckily, SCons takes in regular python functions as "Builders", which I hooked into xml-rpc functions, and we soon had SCons running the pipeline on multiple servers (just five, actually - that's all we'd get for our pipeline). The file-system is an NFS share, which simplifies things a great deal.
Python, however, has been a bit on the slower side. Also, invoking the Java VM every time you need to process a file feels like too much of an overhead. So while the pipeline is functional, and processes the corpus much faster than before (5-6 hours vs 20+ earlier), we are considering re-writing the XML-RPC server in Java. The standalone programs can be easily ported to the server implementation, and invoking shell scripts from Java shouldn't be very different from invoking them from python - things should only improve. I wonder, however, if I should have written this in Hadoop to start with.
However, I have joined a project that has been underway for more than a year, and the processes have been written in mostly an ad-hoc way - shell, python, and Java standalone programs. Converting each of these to mappers and reducers would have been an arduous task.
I decided to re-write the pipeline in SCons. There are many things about this pipeline that represent a conventional build. There are dependencies, and usually newer functionality/processing is added to the later stages of the pipeline. Luckily, SCons takes in regular python functions as "Builders", which I hooked into xml-rpc functions, and we soon had SCons running the pipeline on multiple servers (just five, actually - that's all we'd get for our pipeline). The file-system is an NFS share, which simplifies things a great deal.
Python, however, has been a bit on the slower side. Also, invoking the Java VM every time you need to process a file feels like too much of an overhead. So while the pipeline is functional, and processes the corpus much faster than before (5-6 hours vs 20+ earlier), we are considering re-writing the XML-RPC server in Java. The standalone programs can be easily ported to the server implementation, and invoking shell scripts from Java shouldn't be very different from invoking them from python - things should only improve. I wonder, however, if I should have written this in Hadoop to start with.
December 20, 2010
Correlation != Causation
(from PeteSearch)
My dog Thor hates getting wet, but even when there's rain lashing against the windows he still starts off dancing in circles when it's time for his walk. It's only when I pull out his yellow rain jacket that he slumps and stares at me mournfully. He seems convinced that if I just left the jacket off, the rain would go away.
Much as I try and convince him of the error in his logic, he's unmoved, and it's hard to blame him. Humans will happily swallow studies that use the weasel word 'link' to claim something that is associated with an outcome is its cause … I half-expect to get up one morning and discover that Thor's eaten the raincoat, in the hope of bringing back the sun …
December 19, 2010
Random funny
(from http://neil.fraser.name/news/2007/04/07/ ... the carving there is great too!)
Early Friday morning I walked by a garbage can outside one of the Google cafés. A bird which had been feeding on scraps saw me coming, exploded out of the garbage can and flew off. A passing coworker looked at me incredulously and said "Someone threw out a perfectly good bird."
October 11, 2010
The other side
This is my first attempt at making noodles, the Indo-Chinese way. I put a bit too much of soy sauce, but otherwise, it was nice. In fact, it was good enough for Hao to prefer it over the cucumbers that he had cooked! I wasn't that great with the chopsticks though, and used them as long as the volume of noodles was large enough for me to aim at.
September 09, 2010
Finally ...
... I had someone from China mistake me for a Chinese person. This, after years of being told that I look like Chinese person while in India. Inevitable, I guess.
September 07, 2010
Cabbage and rice tastes great!
Or maybe it is the hunger speaking. I didn't have breakfast - skipped directly to lunch.
This was a day after I visited NYC. Thanks to Shrikanth, I have some photos from my university now.
Have a look. NYC photographs will follow soon.
Edit: Let's try a slideshow -
This was a day after I visited NYC. Thanks to Shrikanth, I have some photos from my university now.
Have a look. NYC photographs will follow soon.
Edit: Let's try a slideshow -
July 16, 2010
The weaker sex?
From http://esr.ibiblio.org/?p=2118:
More like the wiser one.
... women, in general, are not willing to eat the kind of shit that men will swallow to work in this field. Now let’s talk about death marches, mandatory uncompensated overtime, the beeper on the belt, and having no life. Men accept these conditions because they’re easily hooked into a monomaniacal, warrior-ethic way of thinking in which achievement of the mission is everything. Women, not so much. Much sooner than a man would, a woman will ask: "Why, exactly, am I putting up with this?"
... if we really want to fix the problem of too few women in computing, we need to ask some much harder questions about how the field treats everyone in it.
More like the wiser one.
June 11, 2010
Using unicode to annotate emails
I have been using unicode characters like ★, ✘, ✔, ⚠ to annotate my gmail labels. I used it today to mark the subject of one of my mails as a high-priority mail. Wonder if it'll work.
Subscribe to:
Posts (Atom)