From sravich.c at gmail.com Wed Mar 12 12:55:02 2008 From: sravich.c at gmail.com (sravich.c c) Date: Wed Mar 12 12:50:43 2008 Subject: [Mead] Problem with QueryCosine.pl Message-ID: <3eabb5e00803121055mc0ef3ddsb2cc135ea68f2dc3@mail.gmail.com> Sir, I have problem using QueryCosine feature Am trying to run mead using following command line ./mead.pl -meadrc /mead/data/GA3/sample.meadrc GA3 With this I get these errors: Global symbol "$idffile" requires explicit package name at /mead/bin/feature-scripts/QueryCosine.pl line 83. Global symbol "$idffile" requires explicit package name at /mead/bin/feature-scripts/QueryCosine.pl line 88. Global symbol "$idffile" requires explicit package name at /mead/bin/feature-scripts/QueryCosine.pl line 93. Execution of /mead/bin/feature-scripts/QueryCosine.pl aborted due to compilation errors. FATAL: Feature Calculation returned 65280 no element found at line 1, column 0, byte 0 at /usr/lib/perl5/vendor_perl/5.8.8/i386-linux-thread-multi/XML/Parser.pm line 187 our sample.meadrc ************ compression_basis sentences compression_absolute 6 feature QueryTitleCosine /mead/bin/feature-scripts/QueryCosine.pl -q t /mead/data/GA3/GA3.query /mead/data/GA3/docsent classifier /mead/bin/default-classifier.pl Centroid 1 Position 1 Length 9 QueryTitleCosine 1 reranker /mead/bin/cst-rerankers/mmr-reranker.pl 0.6 MEAD-cosine enidf *********** I tried to run write-idf but in vain Can any one help me in solving this problem Thanking you With Regards, Sravich -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.si.umich.edu/pipermail/mead/attachments/20080312/40ece894/attachment.htm From sravich.c at gmail.com Thu Mar 13 00:55:10 2008 From: sravich.c at gmail.com (sravich.c c) Date: Thu Mar 13 00:50:48 2008 Subject: [Mead] Re: Problem with QueryCosine.pl In-Reply-To: <3eabb5e00803121055mc0ef3ddsb2cc135ea68f2dc3@mail.gmail.com> References: <3eabb5e00803121055mc0ef3ddsb2cc135ea68f2dc3@mail.gmail.com> Message-ID: <3eabb5e00803122255l4b176142l32006def419b1fd@mail.gmail.com> By passing enidf and lang specifications to QueryCosine.pl solved my problem. I hope this modification is correct...or....are there any dependencies? Changes made are 1)added "enidf ENG" to feature specification in meadrc "feature QueryTitleCosine /mead/bin/feature-scripts/QueryCosine.pl *enidf ENG *-q t /mead/data/GA3/GA3.query /mead/data/GA3/docsent" 2)added following lines to QueryCosine.pl my $idffile = shift; my $lang = shift; . Awaiting reply... With Regards, Sravich On Wed, Mar 12, 2008 at 11:25 PM, sravich.c c wrote: > Sir, > I have problem using QueryCosine feature > > Am trying to run mead using following command line > ./mead.pl -meadrc /mead/data/GA3/sample.meadrc GA3 > > With this I get these errors: > > Global symbol "$idffile" requires explicit package name at > /mead/bin/feature-scripts/QueryCosine.pl line 83. > Global symbol "$idffile" requires explicit package name at > /mead/bin/feature-scripts/QueryCosine.pl line 88. > Global symbol "$idffile" requires explicit package name at > /mead/bin/feature-scripts/QueryCosine.pl line 93. > Execution of /mead/bin/feature-scripts/QueryCosine.pl aborted due to > compilation errors. > FATAL: Feature Calculation returned 65280 > > no element found at line 1, column 0, byte 0 at > /usr/lib/perl5/vendor_perl/5.8.8/i386-linux-thread-multi/XML/Parser.pm line > 187 > > our sample.meadrc > ************ > compression_basis sentences > compression_absolute 6 > feature QueryTitleCosine /mead/bin/feature-scripts/QueryCosine.pl -q t > /mead/data/GA3/GA3.query /mead/data/GA3/docsent > classifier /mead/bin/default-classifier.pl Centroid 1 Position 1 Length 9 > QueryTitleCosine 1 > reranker /mead/bin/cst-rerankers/mmr-reranker.pl 0.6 MEAD-cosine enidf > > *********** > I tried to run write-idf but in vain > Can any one help me in solving this problem > > Thanking you > With Regards, > Sravich > > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.si.umich.edu/pipermail/mead/attachments/20080313/cde82800/attachment.htm From shanusushmita at gmail.com Tue Mar 18 13:01:46 2008 From: shanusushmita at gmail.com (Shanu Sushmita) Date: Tue Mar 18 12:57:02 2008 Subject: [Mead] using mead in real time web scenario Message-ID: Hi, I am having a speed problem when trying to generate a summary for a group of web documents in real time. The time taken to produce summary for one cluster of web pages varies from 5-10 mins on average. Just wanted to confirm if this is expected time or is there some problem with my system. Thanks, Shanu -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.si.umich.edu/pipermail/mead/attachments/20080318/915c02c2/attachment.htm From radev at umich.edu Tue Mar 18 13:21:11 2008 From: radev at umich.edu (radev@umich.edu) Date: Tue Mar 18 13:16:23 2008 Subject: [Mead] using mead in real time web scenario In-Reply-To: Message-ID: <20080318182111.DB1A46008301E@belobog.si.umich.edu> Hi, Shanu, If your output is large, you can use some preprocessing to make it shorter, e.g., keep only the first n sentences. Of course, this approach depends on the input type and on many other factors, including your choice of features. Drago > > --===============1629592310== > Content-Type: multipart/alternative; > boundary="----=_Part_17156_9489149.1205863306742" > > ------=_Part_17156_9489149.1205863306742 > Content-Type: text/plain; charset=ISO-8859-1 > Content-Transfer-Encoding: 7bit > Content-Disposition: inline > > Hi, > > I am having a speed problem when trying to generate a summary for a group of > web documents in real time. The time taken to produce summary for one > cluster of web pages varies from 5-10 mins on average. > > Just wanted to confirm if this is expected time or is there some problem > with my system. > > Thanks, > Shanu > > ------=_Part_17156_9489149.1205863306742 > Content-Type: text/html; charset=ISO-8859-1 > Content-Transfer-Encoding: 7bit > Content-Disposition: inline > > Hi,

I am having a speed problem when trying to generate a summary for a group of web documents in real time. The time taken to produce summary for one cluster of web pages varies from 5-10 mins on average.

Just wanted to confirm if this is expected time or is there some problem with my system.
>
Thanks,
Shanu
> > ------=_Part_17156_9489149.1205863306742-- > > --===============1629592310== > Content-Type: text/plain; charset="us-ascii" > MIME-Version: 1.0 > Content-Transfer-Encoding: 7bit > Content-Disposition: inline > > _______________________________________________ > Mead mailing list > Mead@lists.si.umich.edu > http://lists.si.umich.edu/mailman/listinfo/mead > > --===============1629592310==-- > > > -- Dragomir R. Radev Associate Professor SI, CSE, Ling U. Michigan, Ann Arbor http://www.eecs.umich.edu/~radev radev@umich.edu From radev at umich.edu Wed Mar 19 13:57:48 2008 From: radev at umich.edu (radev@umich.edu) Date: Wed Mar 19 13:52:55 2008 Subject: [Mead] Re: Problem with QueryCosine.pl In-Reply-To: <3eabb5e00803122255l4b176142l32006def419b1fd@mail.gmail.com> Message-ID: <20080319185748.77F246008311C@belobog.si.umich.edu> Let me ask Thuy and Bryan. > > --===============1835381872== > Content-Type: multipart/alternative; > boundary="----=_Part_11448_11373549.1205387710868" > > ------=_Part_11448_11373549.1205387710868 > Content-Type: text/plain; charset=ISO-8859-1 > Content-Transfer-Encoding: 7bit > Content-Disposition: inline > > By passing enidf and lang specifications to QueryCosine.pl solved my > problem. > I hope this modification is correct...or....are there any dependencies? > > Changes made are > 1)added "enidf ENG" to feature specification in meadrc > > "feature QueryTitleCosine > /mead/bin/feature-scripts/QueryCosine.pl *enidf ENG *-q t > /mead/data/GA3/GA3.query /mead/data/GA3/docsent" > > 2)added following lines to QueryCosine.pl > my $idffile = shift; > my $lang = shift; > .. > Awaiting reply... > > With Regards, > Sravich > > On Wed, Mar 12, 2008 at 11:25 PM, sravich.c c wrote: > > > Sir, > > I have problem using QueryCosine feature > > > > Am trying to run mead using following command line > > ./mead.pl -meadrc /mead/data/GA3/sample.meadrc GA3 > > > > With this I get these errors: > > > > Global symbol "$idffile" requires explicit package name at > > /mead/bin/feature-scripts/QueryCosine.pl line 83. > > Global symbol "$idffile" requires explicit package name at > > /mead/bin/feature-scripts/QueryCosine.pl line 88. > > Global symbol "$idffile" requires explicit package name at > > /mead/bin/feature-scripts/QueryCosine.pl line 93. > > Execution of /mead/bin/feature-scripts/QueryCosine.pl aborted due to > > compilation errors. > > FATAL: Feature Calculation returned 65280 > > > > no element found at line 1, column 0, byte 0 at > > /usr/lib/perl5/vendor_perl/5.8.8/i386-linux-thread-multi/XML/Parser.pm line > > 187 > > > > our sample.meadrc > > ************ > > compression_basis sentences > > compression_absolute 6 > > feature QueryTitleCosine /mead/bin/feature-scripts/QueryCosine.pl -q t > > /mead/data/GA3/GA3.query /mead/data/GA3/docsent > > classifier /mead/bin/default-classifier.pl Centroid 1 Position 1 Length 9 > > QueryTitleCosine 1 > > reranker /mead/bin/cst-rerankers/mmr-reranker.pl 0.6 MEAD-cosine enidf > > > > *********** > > I tried to run write-idf but in vain > > Can any one help me in solving this problem > > > > Thanking you > > With Regards, > > Sravich > > > > > > ------=_Part_11448_11373549.1205387710868 > Content-Type: text/html; charset=ISO-8859-1 > Content-Transfer-Encoding: 7bit > Content-Disposition: inline > > By passing enidf and lang specifications to QueryCosine.pl solved my problem.
I hope this modification is correct...or....are there any dependencies?

Changes made are
1)added "enidf ENG" to feature specification in meadrc
>
            "feature QueryTitleCosine  /mead/bin/feature-scripts/QueryCosine.pl  enidf ENG -q t /mead/data/GA3/GA3.query /mead/data/GA3/docsent"

2)added following lines to QueryCosine.pl
             my $idffile = shift;
>              my $lang = shift;
.
Awaiting reply...

With Regards,
Sravich

On Wed, Mar 12, 2008 at 11:25 PM, sravich.c c <sravich.c@gmail.com> wrote:
>
Sir,
I have problem using QueryCosine feature

Am trying to run mead using following command line
>      ./mead.pl -meadrc /mead/data/GA3/sample.meadrc GA3

With this I get these errors:

Global symbol "$idffile" requires explicit package name at /mead/bin/feature-scripts/QueryCosine.pl line 83. >
Global symbol "$idffile" requires explicit package name at /mead/bin/feature-scripts/QueryCosine.pl line 88.
Global symbol "$idffile" requires explicit package name at /mead/bin/feature-scripts/QueryCosine.pl line 93. >
Execution of /mead/bin/feature-scripts/QueryCosine.pl aborted due to compilation errors.
FATAL: Feature Calculation returned 65280

no element found at line 1, column 0, byte 0 at /usr/lib/perl5/vendor_perl/5.8.8/i386-linux-thread-multi/XML/Parser.pm line 187 >

our sample.meadrc
************
compression_basis    sentences
compression_absolute    6
feature QueryTitleCosine  /mead/bin/feature-scripts/QueryCosine.pl  -q t /mead/data/GA3/GA3.query /mead/data/GA3/docsent >
classifier /mead/bin/default-classifier.pl Centroid 1 Position 1 Length 9 QueryTitleCosine 1
reranker /mead/bin/cst-rerankers/mmr-reranker.pl 0.6 MEAD-cosine enidf

***********
I tried to run write-idf but in vain
> > > Can any one help me in solving this problem

Thanking you
With Regards,
Sravich
>
>

> > ------=_Part_11448_11373549.1205387710868-- > > --===============1835381872== > Content-Type: text/plain; charset="us-ascii" > MIME-Version: 1.0 > Content-Transfer-Encoding: 7bit > Content-Disposition: inline > > _______________________________________________ > Mead mailing list > Mead@lists.si.umich.edu > http://lists.si.umich.edu/mailman/listinfo/mead > > --===============1835381872==-- > > > -- Dragomir R. Radev Associate Professor SI, CSE, Ling U. Michigan, Ann Arbor http://www.eecs.umich.edu/~radev radev@umich.edu From jorge.garcia-flores at cea.fr Thu Mar 27 10:26:07 2008 From: jorge.garcia-flores at cea.fr (GARCIA FLORES Jorge 704360 IRSN) Date: Thu Mar 27 10:22:28 2008 Subject: [Mead] summarizing french Message-ID: <200803271626.07217.jorge.garcia-flores@cea.fr> Hi. In the French Atomic Energy Comission we would like to summarize french documents with MEAD. I wonder if ther's already a french version of the IDF database, or at least a way to summarize documents with foreign characters (UTF-8 encoding)... right now its impossible to treat documents with accented characters (MEAD produces an error). Thanks in advance for your answer Jorge Garcia-Flores Post-doc au CEA/IRSN Centre de Fontenay-aux-Roses Laboratoire d'Ing?nierie de la Connaissance Multim?dia Multilingue (LIC2M) (Multimedia and Multilingual Knowledge Engineering Laboratory) Bat. 38-2 ; 18, rue du Panorama ; BP 6 92265 Fontenay aux Roses Cedex ; France From radev at umich.edu Thu Mar 27 10:33:33 2008 From: radev at umich.edu (radev@umich.edu) Date: Thu Mar 27 10:28:09 2008 Subject: [Mead] summarizing french In-Reply-To: <200803271626.07217.jorge.garcia-flores@cea.fr> Message-ID: <20080327153333.8022E60083127@belobog.si.umich.edu> Mead should be 8-bit (UTF-8) compliant. Waht sort of error are you getting? Drago > > Hi. In the French Atomic Energy Comission we would like > to summarize french documents with MEAD. I wonder if ther's already a french > version of the IDF database, or at least a way to summarize > documents with foreign characters (UTF-8 encoding)... right now its > impossible to treat documents with accented characters (MEAD > produces an error). > > Thanks in advance for your answer > > Jorge Garcia-Flores > Post-doc au CEA/IRSN > Centre de Fontenay-aux-Roses > Laboratoire d'Ing?nierie de la Connaissance Multim?dia Multilingue (LIC2M) > (Multimedia and Multilingual Knowledge Engineering Laboratory) > Bat. 38-2 ; 18, rue du Panorama ; BP 6 > 92265 Fontenay aux Roses Cedex ; France > _______________________________________________ > Mead mailing list > Mead@lists.si.umich.edu > http://lists.si.umich.edu/mailman/listinfo/mead > > -- Dragomir R. Radev Associate Professor SI, CSE, Ling U. Michigan, Ann Arbor http://www.eecs.umich.edu/~radev radev@umich.edu From shanusushmita at gmail.com Thu Mar 27 11:23:43 2008 From: shanusushmita at gmail.com (Shanu Sushmita) Date: Thu Mar 27 11:18:22 2008 Subject: [Mead] summarizing a reat time web page Message-ID: HI, In order to summarize a group of web pages in real time, is there a possibility to generate the summary for them without having to download them. I mean generating summary on the fly. I saw the demo of mead where when given the url of the web page, the MEAD summarizer generates the summary for it. I guess NewsInessence also did real time summarization? Does it require downloading the webpage first and converting it into docsent format to feed into summariser module? In that case it would eb too hard to use it in the real time with web. NO? any idea or suggestion? Shanu -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.si.umich.edu/pipermail/mead/attachments/20080327/1ba67689/attachment.htm From jorge.garcia-flores at cea.fr Thu Mar 27 11:22:56 2008 From: jorge.garcia-flores at cea.fr (GARCIA FLORES Jorge 704360 IRSN) Date: Thu Mar 27 11:19:17 2008 Subject: [Mead] summarizing french In-Reply-To: <20080327153333.8022E60083127@belobog.si.umich.edu> References: <20080327153333.8022E60083127@belobog.si.umich.edu> Message-ID: <200803271722.56264.jorge.garcia-flores@cea.fr> FIRST SCENARIO (without touching Document.pm). Here's the error produced when I run MEAD with a one file cluster (an UTF-8 file with french characters) 11111111111111111111111111111111111111111111111111111111 /home/jg704360/evaluation/mead/bin$ ./mead.pl MORCAS Using system rc-file: /home/jg704360/evaluation/mead/bin/../.meadrc Warning: Can't find user rc-file Cluster: /home/jg704360/evaluation/mead/bin/../data/MORCAS/MORCAS.cluster iconv: S?quence d'?chappement ill?gale ? la position 196 no element found at line 6, column 53, byte 196 at /usr/local/lib/perl/5.8.8/XML/Parser.pm line 187 no element found at line 1, column 0, byte 0 at /usr/local/lib/perl/5.8.8/XML/Parser.pm line 187 no element found at line 1, column 0, byte 0 at /usr/local/lib/perl/5.8.8/XML/Parser.pm line 187 11111111111111111111111111111111111111111111111111111111111 SECOND SCENARIO (with Document.pm::read_document, line 42 commented open (INSTREAM, "iconv -f BIG5 -t UTF-8 $document_filename |"); and changed for open (INSTREAM, "$document_filename"); I get a blank summary, except for those lines where there are no french characters 222222222222222222222222222222222222222222222222222222 /home/jg704360/evaluation/mead/bin$ ./mead.pl MORCAS Using system rc-file: /home/jg704360/evaluation/mead/bin/../.meadrc Warning: Can't find user rc-file Cluster: /home/jg704360/evaluation/mead/bin/../data/MORCAS/MORCAS.cluster [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] Cette loi peut, bien entendu, s'appliquer aux sportifs, spectateurs, organisateurs et journalistes venus assister aux Jeux. [31] D'autres groupes sociaux, ethniques, religieux ou politiques profiteront de l'afflux de journalistes pour attirer l'attention sur leur situation et leurs revendications. 22222222222222222222222222222222222222222222222222222222222222222 Any ideas? Greetings Jorge On Thursday 27 March 2008 16:33, radev@umich.edu wrote: > Mead should be 8-bit (UTF-8) compliant. Waht sort of error are you > getting? > > Drago > > > Hi. In the French Atomic Energy Comission we would like > > to summarize french documents with MEAD. I wonder if ther's already a > > french version of the IDF database, or at least a way to summarize > > documents with foreign characters (UTF-8 encoding)... right now > > its impossible to treat documents with accented characters > > (MEAD produces an error). > > > > Thanks in advance for your answer > > > > Jorge Garcia-Flores > > Post-doc au CEA/IRSN > > Centre de Fontenay-aux-Roses > > Laboratoire d'Ing?nierie de la Connaissance Multim?dia Multilingue > > (LIC2M) (Multimedia and Multilingual Knowledge Engineering Laboratory) > > Bat. 38-2 ; 18, rue du Panorama ; BP 6 > > 92265 Fontenay aux Roses Cedex ; France > > _______________________________________________ > > Mead mailing list > > Mead@lists.si.umich.edu > > http://lists.si.umich.edu/mailman/listinfo/mead From radev at umich.edu Thu Mar 27 15:25:53 2008 From: radev at umich.edu (radev@umich.edu) Date: Thu Mar 27 15:20:27 2008 Subject: [Mead] summarizing a reat time web page In-Reply-To: Message-ID: <20080327202553.C147360083127@belobog.si.umich.edu> > In order to summarize a group of web pages in real time, is there a > possibility to generate the summary for them without having to > download I don't think that you can do that. Or, perhaps you mean, without writing them to disk? If that's what you mean, it should be possible to run mead through a pipe. > them. I mean generating summary on the fly. I saw the demo of mead where > when given the url of the web page, the MEAD summarizer generates the > summary for it. I guess NewsInessence also did real time summarization? Does > it require downloading the webpage first and converting it into > docsent Yes, it converted the web page to docsent. > format to feed into summariser module? In that case it would eb too hard to > use it in the real time with web. NO? It depends on the input size and on your time requirements. > > any idea or suggestion? > > Shanu