我目前正在研究如何将电话号码解析为PDF格式。我正在使用同样使用pdftk
的gem pdf-form
。当我使用FormStack请求表单服务时,电话号码会以JSON的形式返回。
我创建了一个DICTIONARY
来将PDF表单中的字段id映射到FormStack JSON响应中的字段id。这样我就可以遍历它们了。
DICTIONARY = {
'Applicant Name' => '66563757', #Name
'DATE OF BIRTH mmddyyyy' => '66563844', #Date of Birth
##
'TELEPHONE NUMBER area code 1'=> '66563943', #Primary Phone - Area code (000)
'TELEPHONE NUMBER first 3 1' => '66563943', #Primary Phone - (000) 000
'TELEPHONE NUMBER Last 4 1' => '66563943', #Primary Phone - (000) 000-0000
'CELLPHONE NUMBER area code 1'=> '66563947', #Secondary Phone - Area code (000)
'CELLPHONE NUMBER first 3 1' => '66563947', #Secondary Phone - (000) 000
'CELLPHONE NUMBER Last 4 1' => '66563947', #Secondary Phone - (000) 000-0000
'TELEPHONE NUMBER area code 2'=> '66564485', #Phone - Area code (000)
'TELEPHONE NUMBER first 3 2' => '66564485', #Phone - first 3 - (000) 000
'TELEPHONE NUMBER Last 4 2' => '66564485', #Phone - last 4 - (000) 000-0000
}
如您所见,电话号码分为前3位区号,后3位和最后4位。我创建了一个库来将此电话号码解析到PDF表单域中。DICTIONARY
散列中的值是FormStack中数字形式的字段。
USER_DATA
是在填写表单时传递给initialize(user_submission_data)
的内容,它是用户数据。
class PdfScrie < FillablePdfForm
def initialize(user_submission_data)
@user_submission_data = user_submission_data
super()
end
private
DICTIONARY = {
}
APPLICANT_ADDRESS_FORM_FIELD_ID = "66563934"
TENANT_REPRESENTATIVE_ADDRESS_FORM_FIELD_ID = "66564487"
TELEPHONE_NUMBER_1_id = "66563943"
CELLPHONE_NUMBER_1_id = "66563947"
ADDRESS_FIELD_IDS = [
APPLICANT_ADDRESS_FORM_FIELD_ID,
TENANT_REPRESENTATIVE_ADDRESS_FORM_FIELD_ID,
].freeze
PHONE_NUMBER_FIELD_IDS = [
TELEPHONE_NUMBER_1_id,
CELLPHONE_NUMBER_1_id
].freeze
def fill_out
form_fields.each do |field|
id = DICTIONARY[field]
@user_submission_data
.select {|fd| fd[:field] == id}
.each do |field_data|
if address_field?(field_data[:field])
break_address_into_state_city_zipcode(field_data[:value], field)
elsif phone_number?(field_data[:field])
parse_phone_number(field_data[:value])
fill(field, @phone_number_sections.shift)
else
fill(field, field_data[:value])
end
end
end
end
def address_field?(field)
ADDRESS_FIELD_IDS.include?(field.to_s)
end
def break_address_into_state_city_zipcode(address, field)
address_by_section = FormStack::Form.parse_formstack_nested_attrs(address)
address_by_section.each do |section,value|
fill(field, value) if form_field_has_section?(field, section)
end
end
def form_field_has_section?(form_field_name, address_section)
form_field_name.include? address_section.upcase
end
def parse_phone_number(phone_number)
@phone_number_sections ||= phone_number.gsub(/\D+/, "").match(/(...)(...)(....)/).captures
end
def phone_number?(field)
PHONE_NUMBER_FIELD_IDS.include?(field.to_s)
end
end
但是,当在PHONE_NUMBER_FIELD_IDS
上迭代时,只解析第一个电话号码。当我运行binding.pry
fill_out
方法时,我能够看到这个数字实际上就是我想要解析的数字。
[2] pry(#<PdfScrie>)> parse_phone_number(field_data[:value])
=> ["201", "785", "9896"]
[3] pry(#<PdfScrie>)> fill(field, @phone_number_sections.shift)
=> "201"
但是CELLPHONE_NUMBER_1_id
没有被解析,我不明白为什么没有被解析。
这是另一个库上的fill
方法,该库将其解析为PDF。
def fill(key, value)
attributes[ key.to_s ] = value
end
def pdftk
@pdftk ||= PdfForms.new()
end
发布于 2018-09-19 06:28:08
假设我正确地阅读了您的代码,问题是空数组并不是假的。
> foo = []
# => []
> foo ||= [1]
# => []
因此,在处理完第一个电话号码之后,第二个电话号码永远不会放入@phone_number_sections
中进行处理。
您可能需要做的是将新捕获添加到部分中,如下所示:
def parse_phone_number(phone_number)
@phone_number_sections ||= []
@phone_number_sections += phone_number.gsub(/\D+/, "").match(/(...)(...)(....)/).captures
end
或者,由于数字看起来可能会多次相加,因此您可能需要执行以下操作:
def parse_phone_number(phone_number)
if @phone_number_sections.nil? or @phone_number_sections.empty?
@phone_number_sections = phone_number.gsub(/\D+/, "").match(/(...)(...)(....)/).captures
end
end
https://stackoverflow.com/questions/52394303
复制相似问题