|
|
libcats.org
Practical Text Mining with PerlRoger BilisolyProvides readers with the methods, algorithms, and means to perform text mining tasksThis book is devoted to the fundamentals of text mining using Perl, an open-source programming tool that is freely available via the Internet (www.perl.org). It covers mining ideas from several perspectives — statistics, data mining, linguistics, and information retrieval — and provides readers with the means to successfully complete text mining tasks on their own.The book begins with an introduction to regular expressions, a text pattern methodology, and quantitative text summaries, all of which are fundamental tools of analyzing text. Then, it builds upon this foundation to explore: * Probability and texts, including the bag-of-words model * Information retrieval techniques such as the TF-IDF similarity measure * Concordance lines and corpus linguistics * Multivariate techniques such as correlation, principal components analysis, and clustering * Perl modules, German, and permutation tests Each chapter is devoted to a single key topic, and the author carefully and thoughtfully introduces mathematical concepts as they arise, allowing readers to learn as they go without having to refer to additional books. The inclusion of numerous exercises and worked-out examples further complements the book's student-friendly format.Practical Text Mining with Perl is ideal as a textbook for undergraduate and graduate courses in text mining and as a reference for a variety of professionals who are interested in extracting information from text documents.
Популярные книги за неделю:
Система упражнений по развитию способностей человека (Практическое пособие)Автор: Петров Аркадий НаумовичКатегория: Путь к себе
Размер книги: 818 Kb
Сотворение мира (3-х томник)Автор: Петров Аркадий НаумовичКатегория: Путь к себе
Размер книги: 817 Kb
Elementary surveying. An introduction to geomaticsАвтор: Ghilani C.D., Автор: Wolf P.R.Категория: P_Physics, PGp_Geophysics
Размер книги: 43.64 Mb
Только что пользователи скачали эти книги:
Investigations in Universal Grammar: A Guide to Experiments on the Acquisition of Syntax and SemanticsАвтор: Stephen Crain, Автор: Rosalind Thornton
Размер книги: 4.70 Mb
Handbook of Research on Teaching Literacy Through the Communicative and Visual ArtsАвтор: James Flood, Автор: Diane Lapp, Автор: Shirley Brice Heath, Автор: Shirley Brice HeathКатегория: Искусство
Размер книги: 101.63 Mb
Management of Cardiac Arrhythmias, Second Edition (Contemporary Cardiology)Автор: Gan-Xin Yan, Автор: Peter R. KoweyКатегория: Экономика
Размер книги: 13.47 Mb
Books, Bytes and Business: The Promise of Digital PublishingАвтор: Bill Martin, Автор: Xuemei Tian
Размер книги: 21.32 Mb
Multiple Sensorial Media Advances and Applications: New Developments in MulsemediaАвтор: George Ghinea, Автор: Frederic Andres, Автор: Stephen Gulliver
Размер книги: 7.04 Mb
|
|
|