关于flume采集,这个问题一直无法解决,不知道是hadoop集群的问题还是,flume的问题

on_1_201408262011 to /data/flume/event_log/impression_washington_1_201408262011.COMPLETED

2014-08-27 10:24:49,844 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262012 to /data/flume/event_log/impression_washington_1_201408262012.COMPLETED

2014-08-27 10:24:49,976 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262013 to /data/flume/event_log/impression_washington_1_201408262013.COMPLETED

2014-08-27 10:24:50,107 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262014 to /data/flume/event_log/impression_washington_1_201408262014.COMPLETED

2014-08-27 10:24:50,242 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262015 to /data/flume/event_log/impression_washington_1_201408262015.COMPLETED

2014-08-27 10:24:50,333 (SinkRunner-PollingRunner-DefaultSinkProcessor) [ERROR – org.apache.flume.sink.hdfs.AbstractHDFSWriter.isUnderReplicated(AbstractHDFSWriter.java:82)] Unexpected error while checking replication factor

java.lang.reflect.InvocationTargetException

    at sun.reflect.GeneratedMethodAccessor7.invoke(Unknown Source)

    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

    at java.lang.reflect.Method.invoke(Method.java:606)

    at org.apache.flume.sink.hdfs.AbstractHDFSWriter.getNumCurrentReplicas(AbstractHDFSWriter.java:147)

    at org.apache.flume.sink.hdfs.AbstractHDFSWriter.isUnderReplicated(AbstractHDFSWriter.java:68)

    at org.apache.flume.sink.hdfs.BucketWriter.shouldRotate(BucketWriter.java:452)

    at org.apache.flume.sink.hdfs.BucketWriter.append(BucketWriter.java:387)

    at org.apache.flume.sink.hdfs.HDFSEventSink.process(HDFSEventSink.java:392)

    at org.apache.flume.sink.DefaultSinkProcessor.process(DefaultSinkProcessor.java:68)

    at org.apache.flume.SinkRunner$PollingRunner.run(SinkRunner.java:147)

    at java.lang.Thread.run(Thread.java:745)

Caused by: org.apache.hadoop.ipc.RemoteException(java.io.IOException): File /flume/14082710/test-.1409105924501.tmp could only be replicated to 0 nodes instead of minReplication (=1).  There are 2 datanode(s) running and 2 node(s) are excluded in this operation.

    at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget(BlockManager.java:1384)

    at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:2477)

    at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:555)

    at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:387)

    at org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java:59582)

    at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:585)

    at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928)

    at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2048)

    at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)

    at java.security.AccessController.doPrivileged(Native Method)

    at javax.security.auth.Subject.doAs(Subject.java:415)

    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)

    at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2042)

    at org.apache.hadoop.ipc.Client.call(Client.java:1347)

    at org.apache.hadoop.ipc.Client.call(Client.java:1300)

    at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:206)

    at com.sun.proxy.$Proxy13.addBlock(Unknown Source)

    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)

    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

    at java.lang.reflect.Method.invoke(Method.java:606)

    at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:186)

    at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:102)

    at com.sun.proxy.$Proxy13.addBlock(Unknown Source)

    at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.addBlock(ClientNamenodeProtocolTranslatorPB.java:330)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.locateFollowingBlock(DFSOutputStream.java:1226)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSOutputStream.java:1078)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputStream.java:514)

2014-08-27 10:24:50,334 (SinkRunner-PollingRunner-DefaultSinkProcessor) [WARN – org.apache.flume.sink.hdfs.BucketWriter.append(BucketWriter.java:424)] Caught IOException writing to HDFSWriter (File /flume/14082710/test-.1409105924501.tmp could only be replicated to 0 nodes instead of minReplication (=1).  There are 2 datanode(s) running and 2 node(s) are excluded in this operation.

    at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget(BlockManager.java:1384)

    at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:2477)

    at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:555)

    at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:387)

    at org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java:59582)

    at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:585)

    at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928)

    at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2048)

    at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)

    at java.security.AccessController.doPrivileged(Native Method)

    at javax.security.auth.Subject.doAs(Subject.java:415)

    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)

    at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2042)

). Closing file (hdfs://zhangxin@14.18.203.70:9000/flume/14082710/test-.1409105924501.tmp) and rethrowing exception.

2014-08-27 10:24:50,335 (SinkRunner-PollingRunner-DefaultSinkProcessor) [WARN – org.apache.flume.sink.hdfs.BucketWriter.append(BucketWriter.java:430)] Caught IOException while closing file (hdfs://zhangxin@14.18.203.70:9000/flume/14082710/test-.1409105924501.tmp). Exception follows.

org.apache.hadoop.ipc.RemoteException(java.io.IOException): File /flume/14082710/test-.1409105924501.tmp could only be replicated to 0 nodes instead of minReplication (=1).  There are 2 datanode(s) running and 2 node(s) are excluded in this operation.

    at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget(BlockManager.java:1384)

    at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:2477)

    at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:555)

    at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:387)

    at org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java:59582)

    at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:585)

    at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928)

    at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2048)

    at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)

    at java.security.AccessController.doPrivileged(Native Method)

    at javax.security.auth.Subject.doAs(Subject.java:415)

    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)

    at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2042)

    at org.apache.hadoop.ipc.Client.call(Client.java:1347)

    at org.apache.hadoop.ipc.Client.call(Client.java:1300)

    at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:206)

    at com.sun.proxy.$Proxy13.addBlock(Unknown Source)

    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)

    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

    at java.lang.reflect.Method.invoke(Method.java:606)

    at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:186)

    at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:102)

    at com.sun.proxy.$Proxy13.addBlock(Unknown Source)

    at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.addBlock(ClientNamenodeProtocolTranslatorPB.java:330)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.locateFollowingBlock(DFSOutputStream.java:1226)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSOutputStream.java:1078)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputStream.java:514)

2014-08-27 10:24:50,335 (SinkRunner-PollingRunner-DefaultSinkProcessor) [WARN – org.apache.flume.sink.hdfs.HDFSEventSink.process(HDFSEventSink.java:418)] HDFS IO error

org.apache.hadoop.ipc.RemoteException(java.io.IOException): File /flume/14082710/test-.1409105924501.tmp could only be replicated to 0 nodes instead of minReplication (=1).  There are 2 datanode(s) running and 2 node(s) are excluded in this operation.

    at org.apache.hadoop.hdfs.server.blockmanagement.BlockManager.chooseTarget(BlockManager.java:1384)

    at org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getAdditionalBlock(FSNamesystem.java:2477)

    at org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.addBlock(NameNodeRpcServer.java:555)

    at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.addBlock(ClientNamenodeProtocolServerSideTranslatorPB.java:387)

    at org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java:59582)

    at org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:585)

    at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:928)

    at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2048)

    at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2044)

    at java.security.AccessController.doPrivileged(Native Method)

    at javax.security.auth.Subject.doAs(Subject.java:415)

    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)

    at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2042)

    at org.apache.hadoop.ipc.Client.call(Client.java:1347)

    at org.apache.hadoop.ipc.Client.call(Client.java:1300)

    at org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:206)

    at com.sun.proxy.$Proxy13.addBlock(Unknown Source)

    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

    at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)

    at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)

    at java.lang.reflect.Method.invoke(Method.java:606)

    at org.apache.hadoop.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:186)

    at org.apache.hadoop.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:102)

    at com.sun.proxy.$Proxy13.addBlock(Unknown Source)

    at org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolTranslatorPB.addBlock(ClientNamenodeProtocolTranslatorPB.java:330)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.locateFollowingBlock(DFSOutputStream.java:1226)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.nextBlockOutputStream(DFSOutputStream.java:1078)

    at org.apache.hadoop.hdfs.DFSOutputStream$DataStreamer.run(DFSOutputStream.java:514)

2014-08-27 10:24:50,377 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262016 to /data/flume/event_log/impression_washington_1_201408262016.COMPLETED

2014-08-27 10:24:50,510 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262017 to /data/flume/event_log/impression_washington_1_201408262017.COMPLETED

2014-08-27 10:24:50,642 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262018 to /data/flume/event_log/impression_washington_1_201408262018.COMPLETED

2014-08-27 10:24:50,774 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262019 to /data/flume/event_log/impression_washington_1_201408262019.COMPLETED

2014-08-27 10:24:50,908 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262020 to /data/flume/event_log/impression_washington_1_201408262020.COMPLETED

2014-08-27 10:24:51,040 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262021 to /data/flume/event_log/impression_washington_1_201408262021.COMPLETED

2014-08-27 10:24:51,171 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262022 to /data/flume/event_log/impression_washington_1_201408262022.COMPLETED

2014-08-27 10:24:51,303 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262023 to /data/flume/event_log/impression_washington_1_201408262023.COMPLETED

2014-08-27 10:24:51,436 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262024 to /data/flume/event_log/impression_washington_1_201408262024.COMPLETED

2014-08-27 10:24:51,571 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262025 to /data/flume/event_log/impression_washington_1_201408262025.COMPLETED

2014-08-27 10:24:51,716 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262026 to /data/flume/event_log/impression_washington_1_201408262026.COMPLETED

2014-08-27 10:24:51,846 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262027 to /data/flume/event_log/impression_washington_1_201408262027.COMPLETED

2014-08-27 10:24:51,978 (pool-5-thread-1) [INFO – org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:308)] Preparing to move file /data/flume/event_log/impression_washington_1_201408262028 to /data/flume/event_log/impression_washington_1_201408262028.COMPLETED


版权声明:本文为xinxin_zhang原创文章,遵循 CC 4.0 BY-SA 版权协议,转载请附上原文出处链接和本声明。