Author: mark

  • Competitive intelligence tools

    In an earlier post I suggested that too much competitive analysis too early might be a bad idea. But it got me thinking about the tools that are available for gathering competitive intelligence about a business and what someone else might be using to gather data about my business.

    Archive.org

    One of my favorites! Use archive.org to see how your competitors website evolved from the early days until now. If they have a robots.txt blocking iarchive (archive.org’s web crawler) then you’re not going to see anything, but most websites don’t block the crawler. Here’s Google’s early home page from 1998.

    For extra intel, combine Alexa with archive.org to find out when your competitors traffic spiked, and then look at their pages during those dates on Archive.org to try and figure out what they did right.

    Yahoo Site Explorer

    Site explorer is useful for seeing who’s linking to your competitor i.e. who you should be getting backlinks from.

    Netcraft Site Report

    Netcraft have a toolbar of their own. Take a look at the site rank to get an indication of traffic. Click the number to see who has roughly the same traffic. The page also shows some useful stuff like which hosting facility your competitor is using.

    Google pages indexed

    What interests me more than pagerank is the number of pages of content a website has and which of those are indexed and are ranking well. Search for ‘site:example.com’ on Google to see all pages that Google has indexed for a given website. Smart website owners don’t optimize for individual keywords or phrases, but instead provide a ton of content that Google indexes. They then work on getting a good overall rank for their site and getting search engine traffic for a wide range of keywords. I blogged about this recently on a friends blog and it’s called the long tail approach.

    If I’m looking at which pages my competitor has indexed, I’m very interested in what specific content they’re providing. So often I’ll skip to result 900 or more and see what the bulk of their content is. You may dig up some interesting info doing this.

    Technorati Rank, Links and Authority

    If you’re researching a competing blog, use Technorati. Look at the rank, blog reactions (inbound links really) and the technorati authority. Authority is the number of blogs linking to the blog you’re researching in the last 6 months.

    Alexa

    Sites like Alexa, Comscore and Compete are incredibly inaccurate and easy to game. Just read this piece by the CEO of plenty of fish. Alexa provides an approximation of traffic. It’s also subject to anomalies that throw the stats wildly off. Like the time that Digg.com overtook Slashdot.org in traffic. Someone on Digg posted an article about the win and all the Digg visitors went to Alexa to look at the stats and many installed the toolbar. The result was a big jump in Digg’s traffic according to Alexa when nothing had changed.

    Google PageRank

    PageRank is only updated about once every 2 or more months. New sites could be getting a ton of traffic and have no pagerank, while older sites can have huge pagerank but very little content and only rank well for a few keywords. Install Google Toolbar to see pagerank for any site. You may have to enable it in advanced options.

    nmap

    This may get you blocked by your ISP and may even be illegal, so I’m just mentioning it for informational purposes and because this may be used on you. nmap is a port scanning tool that will tell you what services someone is running on their server, what operating system they’re running, what other machines are on the same subnet and so on. It’s a favorite used by hackers to find potential targets. It also has the potential to slow down or harm a server. It’s also quite easy to detect if someone is running this on your server and find out who they are. So don’t go and load this on your machine and run it.

    Compete

    Compete is basically an Alexa clone. I never use this site because I’ve checked sites that I have real data on and compete seems way off. They claim to provide demographics too, but if the basics are wrong, how can you trust the demographics.

    whois

    I use unix command line whois, but you can use whois.net if you’re not a geek. We use a domain proxy service to preserve our privacy, but many people don’t. You’ll often dig up some interesting data in whois, like who the parent company of your competitor is, or who founded the company and is still the owner of the domain name. Try googling any corporation or personal names you find and you might come up with even more data.

    HTML source of competitors site

    Just take a glance at the headers and footers and any comments in the middle of the pages. Sometimes you can tell what server platform they’re running or sometimes a silly developer has commented out code that’s part of a yet unreleased feature.

    Personal blogs of competitors and staff

    If you’re researching linebuzz.com and you’re my competitor, then it’s a good idea to keep an eye on this blog. I sometimes talk about the technology we use and how we get stuff done. Same applies for your competitors. Google the founders and management team, find their blogs and read them regularly.

    dig (not Digg.com)

    dig is another unix tool that queries dns servers. Much of this data is available from netcraft.com mentioned above. But you can use dig to find out who your competitor uses for email with ‘dig mx example.com’ and you can do a reverse lookup on an ip address which may help you find out who their ISP is (netcraft gives you this)

    Another useful thing that dig does is give you an indication how your competitor is set up for web hosting – if they’re using round-robin DNS or a single IP with a load sharer.

    traceroute

    Another unix tool. Run: ‘/usr/sbin/traceroute www.example.com’ and you’ll get a list of the path your traffic takes to get to your competitors servers. Look at the last few router hostnames before the final destination of the traffic. You may get data on which country and/or city your competitors servers are based in and which hosting provider they use. There’s a rather crummy web based traceroute here.

    Google alerts

    Set up Google news, blog and search alerts for both your competitors brands and your own because your competitors may mention you in a blog comment or somewhere else.

    There is a lot more information available via SEC filings, Secretary of State websites and so on – perhaps the subject of a future entry.

  • Business innovation for developers

    Many entrepreneurs, particularly the MBA set, start with competitive analysis. Sure, it’s a valid approach and you might find a gap in the market that you can easily fill or a product or service that could do with some improvement. But if Larry and Sergei did that before they started playing with the PageRank algorithm, they might not have gotten as far as the first keystroke.

    Here’s a list of 98 social networking websites on Wikipedia – in case you’re looking at getting into that space.

    Many of Einstein’s most original ideas occured to him outside of academia while at the patent office from 1903 to 1911, including his paper on electrodynamics of moving bodies which proposed the idea of special relativity.

    When I chat with friends and fellow entrepreneurs, I’ll throw out an idea and the reaction is often a comparison to other ideas. “So and so is working on something similar” or “you should take a look at such and such”. So existing ideas and products are the departure point for our conversation.

    If you’re a developer, your strength is in your ability write original code, not in your ability to analyze the market-place.

    If you have an idea and you have the ability to implement it yourself, I recommend developing it somewhat before doing any competitive analysis or exposing it to your friends and family. Just take an extra week to play with it. You might come up with a completely original idea.

  • Negotiating your aquisition

    Just a tiny bit of wisdom I picked up along the way. As always, I am not a lawyer and this is not legal advice.

    If you’re selling your business, you’re going to be handed an M&A agreement. That agreement is probably going to have something like 15 pages of representations and warranties – things that you claim are true about your business. At the end of the reps and warranties, there’s going to be a penalty section. The penalty section defines what happens to you if any of the reps and warranties are found to be incorrect or if a lawsuit arises out of something that occurred before the sale of your business.

    You may hire an M&A attorney. Said attorney may tell you that it’s going to take 3 weeks and many billable hours to go through the reps and warranties with a fine tooth comb. You may find yourself unwilling to pay for said attorney’s private jet, in which case I strongly recommend focusing on the penalties at the end of the reps and warranties. The more the penalties are reduced, the less the reps and warranties matter.

    It’s a great indication if you find yourself on the same side of the table as your acquirer arguing with the lawyers because you’re both so excited to see the deal happen. If you don’t have that experience, then think twice about what you’re getting into.

  • Extreme sports conflagration – and ground effect

    I just noticed one of my youtube videos has over 20,000 views. I put it together a while ago using windows movie editor and a flash ripper. The soundtrack is Breaking Benjamin, Blow me Away. The tube is at Teahupoo and is one of the biggest ever ridden. The human glider is in Verbier, Switzerland and is Loïc Jean-Albert cruising 15 ft above the snow. The train surfer is unidentified and is in Germany surfing a bullet train at 330Km/h. The skater is Bob Burnquist in his own back yard on a $250,000 ramp of his own making. The climber jumping off the cliff is the late great climbing legend, Dan Osman.

    One of the comment threads on youtube wonders if there’s an air cushion that helps Jean-Albert stay so close to the ground with such accuracy. The phenomenon is called Ground Effect and pilots are taught to add a little extra downward pressure to counteract it when landing. It’s also very dangerous on takeoff because your plane can become airborne before you reach takeoff velicity and as soon as you climb away from the ground you stall and crash.

    Ground Effect is caused when a wing’s height above the ground is less than it’s span. It’s caused by a reduction in induced drag. Wing-tip vortices are reduced and there’s also a reduction in upwash and downwash. That means that when you’re experiencing ground-effect, the amount of thrust required to produce lift is reduced.

    If Jean-Albert is in fact experiencing ground-effect, then the amount of altitude he has to trade for speed is reduced. But I doubt it. When a wing is at a height equal to it’s span, the decline in induced drag is only 1.4%. When it’s at a height equal to one fourth of it’s span, the loss of induced drag is about 24%. So if Jean-Albert is 6ft tall, he would have to be 6ft from the ground to begin experiencing ground-effect and 2ft from the ground to get any real benefit from it.

    At that altitude, he’s probably going to get smacked by a mogul on his way down.

  • How to start a startup – for developers

    Creating your own startup is easy. You don’t need an MBA. You don’t need an expensive law firm. You don’t need an ‘older more mature CEO’ to hold your hand. And if you can afford 6 months without an income and $5k to get started, you might not even need a VC or Angel investor.

    You just need to be OK with researching to find the info you need and acting on it. I’ve written this as a very basic guide to getting started for under $5k – or $4985 to be exact.

    Inc yourself – $415 + $50 for Nolo books

    You need to create a corporation so that you have a degree of legal protection. You get to choose from a C Corp, an S Corp or an LLC. Forget about partnerships or Sole Trader. I’ve always used a C Corp because it’s the only structure that lets foreigners invest in your corporation, and that’s a requirement of mine.

    I’ve always Inc’d myself and not used a law firm. I don’t think it’s necessary and I’ve sold C Corps that I’ve Inc’d myself without any admin issues. Visit NOLO.com and buy a book for your state and do it yourself.

    We are incorporated in Delaware and Kerry just sent me the following data for your enjoyment:

    Delaware startup costs: $90 filing fee, $50 for a registered agent ($140 total)
    Delaware annual fees due end of year: $60 filing and franchise taxes (up to 3000 shares authorized), $50 registered agent ($110 total)

    You also need to file as a foreign corporation where the corp does business or has a physical presence. Washington fees are as follows:

    Washington startup fees: $205 filing/license fees, $70 registered agent ($275 total)
    Washington annual fees due end of year: $90 filing/license fees, $70 registered agent ($160 total)

    Trademark yourself – $2,200

    Sure you’ve got an awesome domain name, but if it’s cocacola.com you’re in trouble. Use a lawyer to do a trademark search. It’s going to cost you $300 to $500 bucks and it’s worth every cent. You can do an initial search on the USPTO website for free, but get a law firm to do this for you because there are more databases than just the USPTO.

    It might come back with some pizza joint out in Smallville USA that has a name vaguely similar to yours – just ignore that. You’re worried about companies that target the same market as you do.

    Then use a lawyer to file your trademark with the USPTO. Once you have it on file and can start counting down to that magical 5 year date when your trademark becomes incontestable. This will cost about $1,700
    I had the unfortunate experience of getting sued over trademark infringement a few years ago, so this is now one of the first things I take care of.

    Provisionally Patent your technology – $2,000

    If you’re wicked smart and invented something new, then file a provisional patent. This buys you 1 year during which you must file a full patent. The provisional patent can cost as little or as much as you like.

    You can write and file your own provisional patent for as little as $200. That’s a bad idea because when your company is worth something and it’s time to sell, and you’re doing due diligence with your future acquiror, they’re going to ask which law firm did your provisional patent and you’re going to say proudly that you did it yourself and the conversation will move on as if your little patent doesn’t exist.

    Use a law firm.

    The full patent that you must file within a year may cost as much as $25,000, but by that time you’re hopefully making so much cash out of your wicked smarts that you can afford that.

    Do your own accounts – $300 (quickbooks pro license + course material)

    I put myself on an accounting course a while back.I also did a quickbooks course. Once complete, I had enough knowledge to manage my own corporate accounts and promptly turned our books over to my amazing wife who now handles all our accounts and legal stuff. (evil grin) She’s actually lying downstairs reading a book on intellectual property as I write this. Crazy woman!

    If you’re going to do your own accounts, learn the fundamental double entry system and basic accounting equation. Then also learn how to use your accounting software – Quickbooks is awesome for corporate accounts. Quickbooks abstracts a lot of the internal stuff away from you, like the double entry system – and that’s ok for a while, but eventually you’re going to hit something a little complicated and you need to have that basic knowledge to bail you out. Or you can just buy a CPA for $50/hour and get them to help you out when you get stuck.

    Alternatively…

    Just outsource your accounts. Quite a few of my friends do and that works for them too. Just make sure you get a competent accountant and at least have enough knowledge to ask the right questions and double check some of their work.

    Learn basic finance – $20.00

    Read this book. Read the whole book. Do not pass go, do not think you can outsource understanding your balance sheet or understanding the time value of money. You need a basic knowledge of finance.

    Now, drop everything and just do it

    Once you’ve got all this basic but necessary stuff out of the way, build your business as fast as you can!!

    If you think it’s going to take a month to create your prototype, give yourself 2 weeks and do it in 3. Your time is your most expensive commodity, so haul ass and make it happen!!

    Just don’t forget to listen to your customers and let them shape your product because they are, after all, the only reason your business exists. More about that in a future entry.

    Disclaimer: This is not legal advice and I am not a lawyer.

  • SEO, the long tail approach

    I recently guest blogged an entry titled the long tail approach to SEO on a friends new blog. It’s based on some of my own experience and was inspired by a post on webmasterworld from someone who is about to exceed 1 million uniques from SEO with around 100k pages of content, which isn’t much at all.

  • Lessons from three weeks of intensive I18N

    When we launched LineBuzz on May 10, we had no idea that most of our press coverage was going to be Japanese. A site called 100Shiki.com put us up as dot-com of the day. All of a sudden we had lots of Japanese users. A few days later, a very popular blogger in China gave us a mention and we had lots of Chinese users too. Within a week we had over 15 languages on the site.

    Three intense weeks later we launched an I18N version of the site.

    Here’s a brief summary of some of the key issues we had to deal with when i18n’ing an app that has 50/50 client-server code and lots of communication between the two.

    The code that is LineBuzz is very text intensive by the nature of the application. We provide inline comments without a browser plugin. One of the unique things about LineBuzz is that it doesn’t matter which page you post an inline comment on. The comment will appear anywhere on the website where the text and its surrounding paragraph appears.

    So as you can imagine, we use a lot of regular expressions, character code conversions and text lengths.

    Safari – not the worlds best browser

    The first thing that broke was Safari. Safari’s regex engine in Javascript is seriously busted. It doesn’t support unicode characters at all. IIRC it simply returns true for any regex with unicode. So their claim that it’s the worlds best browser really irks me. So I had to write a fix-safari layer for anything that involved processing unicode.

    No round-trip for jp charsets

    The next thing that bit me was Japanese character set support. The Japanese use two main character sets: EUC-JP and Shift_JIS. The latter is a product of windows and the former is from unix. These both caused a major headache because they don’t round-trip convert to Unicode. Translated, that means that you can’t convert these characters to a unicode character set like UTF-8 and then convert them back to their native character set and expect the original to equal the converted characters. The solution: Store the raw character data for all character sets as binary and only convert to unicode if I absolutely must. I use UTF-8 on linebuzz.com, so that’s a scenario where I convert from binary to UTF-8.

    When is a space not a space

    Another thing that bit me was space character codes and spaces in regex. In unicode there are about 20 different space characters. Some regex engines are smart and recognize them all. Others only recognize the traditional ascii space character. So routines that for example, removed spaces, had to be hand tailored to deal with every unicode space.

    String.charCodeAt() == lies lies lies!!

    Character codes differ across operating systems. Some character sets contain characters that have a different character code on OS X than on Unix. Yes, even in the same browser using the same javascript engine (firefox for example), the character codes are different. So any routines that rely on consistent character codes across platforms have to deal with this little nightmare.

    All this is behind us now and the Linebuzz code handles any character set in any language beautifully.