VB程序员博客
要求 1 最好不用webbrowser,实在没别的办法也凑合了.
2 最重要的:不是取源码 是页面上的文字.
3 进程要同步的.
急!!!!@V@
先谢!!
如果webbrowser凑合了。取 webbrowser.doucument.body.innerText
我上面说的不很清楚:
因为页面上有很多数据在源文件上看不到
是js刷出来的 我用了很多办法都没得到解决办法
曾经想用webbrowser生成静态html后在保存 但这个办法效率太低
因为有很多图片什么的要显示 webbrowser有段时间一直是busy
而 webbrowser.Document又如何saveAS?
如何因为这个问题再抓包分析….
那我宁可牺牲时间来用webbrowser
<
是js刷出来的 我用了很多办法都没得到解决办法
ajax??
我记得有个对象是html的组件,用winsock+那个也可以执行一下脚本
但是这个涉及这些脚本的具体实现方案了,还有是不是也引用了外部脚本等等,这个不一定的…..
如果精益求精的考虑是这样的,不过这个方案也未必比webbrowser要优化
思路:
1. 用inet取得网页源代码
2. 用正则分析此源代码,得到想要文字
http://q.stock.sohu.com/cn/600250/index.shtml
帮我分析下吧
我需要这个网页上的几个部分
1 南纺股份
(600250)7.434.21%+0.302009-11-25 15:00均 价: 7.19现手: 51总金额: 4638最高: 7.44涨停: 7.84昨收: 7.13市盈率: 92.18总手: 64497换手率: 2.49%最低: 7.01跌停: 6.42今开: 7.06
—————————————
2 个股概况(这部分 源文件里有)
—————————————
3 所属板块
—————————————–
4 1日买卖量对比(这部分 源文件里有)
———————————————-
5 最好是能把flash需要刷的数据提取出来 能在flash控件里显示
//5 最好是能把flash需要刷的数据提取出来 能在flash控件里显示
没懂…看到flash了,你需要提出这个数据还要干什么?
<
楼上的liuxiaoyi666,哪儿都有你的身影呀,呵呵,看来真的想再得个星星
我有个从新浪读股票数据的代码,和你交换如何 ^_^!
给我两个Q币,或者是帮我的QQ宠买点药和食物.偶的小可爱快挂了
可以取到以下数据
Public Enum 数据
股票名字 = 0
今日开盘价 = 1
昨日收盘价 = 2
当前价格 = 3
今日最高价 = 4
今日最低价 = 5
买一竞买价 = 6
卖一竞卖价 = 7
成交的股票数 = 8
成交金额 = 9
买一股 = 10
买一价 = 11
买二股 = 12
买二价 = 13
买三股 = 14
买三价 = 15
买四股 = 16
买四价 = 17
买五股 = 18
买五价 = 19
卖一股 = 20
卖一价 = 21
卖二股 = 22
卖二价 = 23
卖三股 = 24
卖三价 = 25
卖四股 = 26
卖四价 = 27
卖五股 = 28
卖五价 = 29
日期 = 30
时间 = 31
End Enum
<script>document.domain='sohu.com';PEAK_ODIA=parent.PEAK_ODIA; </script> <script>PEAK_ODIA(['price_A1',['cn_600250','南纺股份','7.43','4.21%','+0.30','']]) </script> <script>PEAK_ODIA(['price_A2',['7.19','92.18','51','64497','4638','2.49%','7.44','7.01','7.84','6.42','7.13','7.06','18.60','2.98']]) </script> <script>PEAK_ODIA(['quote_m_r',"['14:58','7.43','7.1883','490','36'],['14:59','7.43','7.1902','487','36'],['15:00','7.43','7.1904','51','4']"]) </script> <script>PEAK_ODIA(['quote_k_r',"[20091125','7.06','7.43','7.44','7.01','64497','4638','2.49%','+0.30','4.21%']"]) </script> <script>PEAK_ODIA(['quote_wk_r',"['20091125','7.50','7.43','7.95','7.01','287389.67','21618','419.98%','13.00%','0.88','20091123']"]) </script> <script>PEAK_ODIA(['quote_mk_r',"['20091125','6.40','7.43','7.95','6.15','1106784.87','79121','644.70%','4.21%','0.30','20091102']"]) </script> <script>PEAK_ODIA(['index',['zs_000001','上证指数','3290.16','+66.64','/zs/000001/index.shtml'],['zs_399001','深证成指','13814.55','+361.27','/zs/399001/index.shtml'],['zs_399101','中小板综','5666.04','+187.10','/zs/399101/index.shtml']]) </script> <script>PEAK_ODIA(['perform',['4%','74','7.43','19','7.44','288','7.45','432','7.46','64','7.47','30','7.42','125','7.41','463','7.40','307','7.38','1','7.37','10','34511','29986']]) </script> <script>PEAK_ODIA(['dealdetail',['15:00','-7.43','4%','51','4'],['14:59','-7.43','4.21%','136','10'],['14:59','-7.43','4.21%','51','4'],['14:59','7.44','4.35%','62','5'],['14:59','-7.43','4.21%','30','2'],['14:59','-7.43','4.21%','123','9'],['14:59','-7.43','4.21%','85','6'],['14:58','7.43','4.21%','157','12'],['14:58','7.43','4.21%','7','0.52'],['14:58','7.43','4.21%','127','9'],['14:58','7.43','4.21%','97','7'],['14:58','-7.42','4.07%','47','3'],['14:58','-7.43','4.21%','55','4']]) </script> <script>PEAK_ODIA(['pricedetail',['7.37','428','32','27%'],['7.38','1388','102','100%'],['7.40','1334','99','99%'],['7.41','219','16','12%'],['7.42','345','26','69%'],['7.43','1026','76','38%'],['7.44','62','5','100%']]) </script> <script>PEAK_ODIA(['sector',['8','创投板块','2.75%','/cn/bk_8.shtml'],['23','纺织服装','3.44%','/cn/bk_23.shtml'],['46','券商重仓','3.42%','/cn/bk_46.shtml'],['64','外贸板块','3.62%','/cn/bk_64.shtml'],['119','低价','3.47%','/cn/bk_119.shtml'],['370','长三角','3.47%','/cn/bk_370.shtml'],['451','江苏板块','3.88%','/cn/bk_451.shtml']]) </script> <script>PEAK_ODIA(['changein5m',['cn_600826','兰生股份','16.71','2.82%','/cn/600826/index.shtml'],['cn_600969','郴电国际','13.25','2.07%','/cn/600969/index.shtml'],['cn_600019','宝钢股份','8.93','2.06%','/cn/600019/index.shtml'],['cn_600765','中航重机','23.12','1.71%','/cn/600765/index.shtml'],['cn_600594','益佰制药','18.27','1.61%','/cn/600594/index.shtml'],['cn_600557','康缘药业','21.31','1.52%','/cn/600557/index.shtml'],['cn_600340','*ST国祥','13.58','1.48%','/cn/600340/index.shtml'],['cn_600750','江中药业','23.10','1.36%','/cn/600750/index.shtml'],['cn_600323','南海发展','11.29','1.34%','/cn/600323/index.shtml'],['cn_600135','乐凯胶片','9.31','1.09%','/cn/600135/index.shtml']]) </script> <script>PEAK_ODIA(['time',['2009','11','25','15','00','59']]) </script>
用抓包工具分析一下,你说的是这个东西吧??
<
liuxiaoyi666:
好象不是这个东西 yahoo上看的比较清楚看到某时刻的数据和他们变量名 但不清楚是怎么被flash播放器读进去的
楼主,你的要求达不到,别浪费时间了。
你找个抓包工具….这算是ajax,这部分数据是时时抓取的,你样flash的数据也能取出来了….
不过你需要不停的读
falsh放弃了
其他的有办法取道数据吗?
<
<
<
http://s2.biz.itc.cn/cn/script/hq-1.3.5.js
所属板块的信息在这个文件里呢….不过不用看了,这个混淆过了,很难破译了…加上一大堆的unicode
编码….还是webbrowser吧….
webbrowser.document.getElementById("BIZ_IS_sector").innerHTML
楼主可以参考一下
http://topic.csdn.net/u/20091030/19/d94e3052-945e-4cae-8463-c9af347e1b34.html
<
<
<
<
<
飘过,帮顶
<