Re: [ai-94] Questions on computational linguistics

From: user 1796355
Sent on: Friday, September 11, 2009 9:59 PM
On Sat, Sep 12, 2009 at 12:51 AM, Manch <[address removed]> wrote:
> Thanks Ben. I think I may have seen StockMood in one of the NewTech meetups.
> If not StockMood then something very similar.
>
> Can you comment on how difficult it is to build your systems? The 2 systems
> you mentioned are interesting in different aspects: financial news has a
> relatively narrow domain. On the other hand, tweets are much shorter and
> much simpler in semantics.

Building the systems, hmm...

There's a scalable, properly-structured database aspect, then a
"getting the data in real-time" aspect ... those are standard but not
necessarily trivial...

You can use standard algorithms like SVM or GP for the actual classification...

You need to build a training corpus, via having experts or Mechanical
Turkers mark up some texts with sentiment ratings...

Then the real creativity comes into how do you map texts into feature
vectors.  Pure statistical word frequency TFIDF stuff?  Tagging
sentences, extracting common noun phrases, etc.  This is the part that
can be quick or time-consuming depending on how good you need the
results to e...

And text preprocessing.  Do you separate headlines from article text?
How do you weight each?  etc.  Lots of small domain-dependent
decisions...

So, it's a standard datafeed-based DB-backend Web product, with
integration of some existing machine learning code, plus an open-ended
amount of work on statistical and linguistics based text processing...

To get good results ... anywhere from a few man-months up to infinity ;-)

ben g

Syntience Inc.

AI research company. Provides video equipment, time, and web space

Offer a perk for our members and get exposure.

Offer a perk →
Other nearby
Meetups
Why these groups?
x

The Meetup Groups shown here are topically similar to Bay Area Artificial Intelligence Meetup Group.

Groups are more likely to be displayed here if they:

  • have a Meetup scheduled
  • have a high rating
  • have a group photo
  • are "public" and not "private"
  • have shown they are likely to stick around (older than 30 days)
Find more Meetup Groups
near Menlo Park

Log in

  • Not registered with us yet?
or

Log in to Meetup with your Facebook account.

Sign up

or

Join this Meetup Group even quicker with your Facebook account.

By clicking the "Sign up using Facebook" or "Sign up" buttons above, you agree to Meetup's Terms of Service