Pippilongstrings

From EU-wiki

Jump to: navigation, search


Pippilongstrings: Reverse-engineering the EU

When you read a directive the first time, it is impossible to know if you are reading an EU meme of ancient origin or just some random EU gibberish. Pippilongstrings will help you find out.

The purpose of this project is to identify strings of text that occur many times in the treaties, the acquis communautaire and in ECJ case law. Pippilongstrings assumes that such strings carry meaning. Like particular genes express the function of a protein, the law has its particular DNA sequences which express principles without which the law would not be regular, or predictable.

BETA VIEW: http://www.erikjosefsson.eu/sites/default/files/pippi_CETA_beta.html

Software

http://github.com/stef/le-n-x is a Django web app that does natural language parsing.

Comms

Pad : http://etherpad.com/pippilongstrings

Irc: Pippilongstrings/20100121, Pippilongstrings/20100224

Minutes: Pippilongstrings/20100121/protocol

Corpus

Pippilongstrings/Testing Corpus (already processed)

Corpus candidates:

  • Eur-Lex
  • ECJ verdicts?

Misc and other

Articles: http://findarticles.com/p/articles/mi_m1387/is_1_48/ai_57046531/

Other services: http://www.urkund.se/SE/om_urkund.asp

Result from FTA/Korea https://secure.urkund.com/view/2090856-923220-207872

Archived discussion

http://www.ellispub.com/ojolplus/help/celex.htm#sectors2

> Om jag fattar dig rätt så behövs det i värsta fall tre element i tabellen:
>
>     typ             nummer              non-celex-url
> ___________________________________________________________
> |  Regulation  |   97/145/EC    |                          |
> |  Directive   |   2007/66/EC   |   OJ:L:2007:335:0031:01  |
> |  Directive   |   2002/19/EC   |                          |
> |  Decision    |   2003/111/EC  |                          |
>
> ???
Ja. Förutom att kolumnnamnet "non-celex-url" känns lite fel eftersom
innehållet inte är en URL.
Behöver vi ha tabellen på wikin? Det enklaste vore att ha det i en 
textfil. Men om det är viktigt att ha det på wikin skulle det gå
också.


Related Links

car games

lcd tv

Personal tools