Pippilongstrings
From EU-wiki
Pippilongstrings: Reverse-engineering the EU
When you read a directive the first time, it is impossible to know if you are reading an EU meme of ancient origin or just some random EU gibberish. Pippilongstrings will help you find out.
The purpose of this project is to identify strings of text that occur many times in the treaties, the acquis communautaire and in ECJ case law. Pippilongstrings assumes that such strings carry meaning. Like particular genes express the function of a protein, the law has its particular DNA sequences which express principles without which the law would not be regular, or predictable.
BETA VIEW: http://www.erikjosefsson.eu/sites/default/files/pippi_CETA_beta.html
Software
http://github.com/stef/le-n-x is a Django web app that does natural language parsing.
Comms
Pad : http://etherpad.com/pippilongstrings
Irc: Pippilongstrings/20100121, Pippilongstrings/20100224
Minutes: Pippilongstrings/20100121/protocol
Corpus
Pippilongstrings/Testing Corpus (already processed)
Corpus candidates:
- Eur-Lex
- TBA verdicts EPO/TBA
- ECJ verdicts?
Misc and other
Articles: http://findarticles.com/p/articles/mi_m1387/is_1_48/ai_57046531/
Other services: http://www.urkund.se/SE/om_urkund.asp
Result from FTA/Korea https://secure.urkund.com/view/2090856-923220-207872
Archived discussion
http://www.ellispub.com/ojolplus/help/celex.htm#sectors2
> Om jag fattar dig rätt så behövs det i värsta fall tre element i tabellen: > > typ nummer non-celex-url > ___________________________________________________________ > | Regulation | 97/145/EC | | > | Directive | 2007/66/EC | OJ:L:2007:335:0031:01 | > | Directive | 2002/19/EC | | > | Decision | 2003/111/EC | | > > ???
Ja. Förutom att kolumnnamnet "non-celex-url" känns lite fel eftersom innehållet inte är en URL. Behöver vi ha tabellen på wikin? Det enklaste vore att ha det i en textfil. Men om det är viktigt att ha det på wikin skulle det gå också.
