Thanks! Let us just see if anyone else have good ideas. :-)
Type: Posts; User: Checker1977
Thanks! Let us just see if anyone else have good ideas. :-)
Thanks. I understand the solution is not trivial. I just want to know whether there are any existing solution which could be used extract keywords from a specific page, like open source?
I have no...
I mean the semantics meaning of a web page. For example, when you browse MSN money pages, you got keywords like financial, stock, debts, layoff, auto industry, something like this.
Just like when...
Any reference, code/paper/turotials are fine. My purpose is just to extract keywords from a web page. Another job is to category the web page -- for example, identify it as financial web page or...
Sorry, I do not think you answered what I asked. :-)
I am asking what technologies could be used to extract keywords and category information from web page, not how to get the web page (including...
I am going to write a Demo to extract keyword and category of a web page, does anyone know any open sources/samples/documents/book for me to start with?
BTW: I need to deal with some...
In RFC2616, there are content-encoding, transfer-encoding and accept-encoding. I am not sure which one applies to the encoding of response html. Any ideas?
Which RFC?
I only know from http header meta field, it could be assigned html encoding. I am not sure whether there are any other ways to assign html encoding? Like HTTP response header? Thanks!
The problem is I did not find from which paragraph of the Html Spec 5, the conclusion "it will be treated like it had a body around it" is reached. Any ideas?
But how does standard say whether content not wrapped in header and also not wrapped in body belongs to header or body?
Hi CB, do you mean section "4.4.7 The header element"?
http://www.w3.org/html/wg/html5/#the-header-element
I read it a couple of times but can not find the logics you mentioned,
1. header...
Thanks CB. I did read Html 5 Spec today.
http://www.w3.org/html/wg/html5/#the-header-element
I can not find out where you mentioned both header and body tags are optional. Could you let me know...
I did some search, do you mean here?
http://www.w3.org/TR/REC-html40/
But where is your 3 conclusions comes from? I read related Http header/body part, but no such 3 rules below.
...
CB, the only thing I read is RFC2616, Http standard. What do you mean HTML standard??
CB and others, I have tried in browser, I have 3 browsers, IE8, Firefox 3 and the Google Chrome. I also have powershell at hand if you call it is a browser which could retrive web content.
My...
It is Tech Board. We can discuss anything weird.
If a page does not have header and body element, in the RFC standard, we should treat the information as header or as body?
- I tested with IE and IE treats as body;
- I read from Http 1.1 RFC,...
CB, you are hiding the issue, not answering! :-)
For the noframe, here is what W3C said,
http://www.w3.org/TR/REC-html40/present/frames.html#h-16.4.1
"The NOFRAMES element specifies content...
CB, I think NOFRAME element is only used when browser is not supporting frame feature. Normally we should use frameset and frame, is that correct understanding?
Hi,
I am studying the differences between FRAME and IFRAME of Html. I read up a coupld of documents but still not clear, including W3C ones. I think the major differences is,
- page contains...
Thanks. So bad to see we can not use some customized data type. In order to make parameter list shorted, I wrap all parameters of some function into a single struct, and define the struct also into...
I was designing a COM in-process server. Previously it works with C++ client and works fine. But now some other guys have ideas to expand the coverage to VB and JScript clients. I am short of...
Thanks man, my code hangs at UploadData. You mean I stepped into this function to see assembly code?
Good idea! I have used tcpdump before. BTW: I just want to make sure there is no profile tool provided by VS or any Windows built-in tools which could provide "better" analysis performance. If they...